Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
GUI-Actor-2B-Qwen2-VL
like
18
Follow
Microsoft
17.2k
Image-Text-to-Text
Transformers
Safetensors
qwen2_vl
conversational
text-generation-inference
arxiv:
2506.03143
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
GUI-Actor-2B-Qwen2-VL
4.47 GB
2 contributors
History:
12 commits
This model has 1 file scanned as unsafe.
Show
files
qianhuiwu
Update README.md
8f87b36
verified
4 months ago
.gitattributes
1.57 kB
Upload model weights.
7 months ago
README.md
6.87 kB
Update README.md
4 months ago
added_tokens.json
537 Bytes
Upload model weights.
7 months ago
args.json
11.7 kB
Upload model weights.
7 months ago
chat_template.json
1.05 kB
Upload model weights.
7 months ago
config.json
1.46 kB
Upload model weights.
7 months ago
generation_config.json
249 Bytes
Upload model weights.
7 months ago
merges.txt
1.67 MB
Upload model weights.
7 months ago
model.safetensors
4.45 GB
xet
Upload model weights.
7 months ago
preprocessor_config.json
498 Bytes
Remove `size` from processor config file to be compatible with Qwen2.5VL and transformer 4.51.3
6 months ago
special_tokens_map.json
965 Bytes
Upload model weights.
7 months ago
tokenizer.json
11.4 MB
xet
Upload model weights.
7 months ago
tokenizer_config.json
5.07 kB
Upload model weights.
7 months ago
trainer_state.json
342 kB
Upload model weights.
7 months ago
training_args.bin
8.31 kB
xet
Upload model weights.
7 months ago
vocab.json
2.78 MB
Upload model weights.
7 months ago