Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
zzhang1987
/
Qwen2.5-VL-3B-Instruct-Open-R1-Distill
like
0
Image-Text-to-Text
Transformers
Safetensors
qwen2_5_vl
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-VL-3B-Instruct-Open-R1-Distill
/
training_args.bin
Commit History
Training in progress, step 15
356a105
verified
zzhang1987
commited on
Mar 11, 2025
Training in progress, step 15
e48c76e
verified
zzhang1987
commited on
Mar 7, 2025
Training in progress, step 120
4d949fa
verified
zzhang1987
commited on
Mar 7, 2025
Training in progress, step 15
408e10d
verified
zzhang1987
commited on
Mar 5, 2025
Training in progress, step 15
89c4782
verified
zzhang1987
commited on
Mar 5, 2025
Training in progress, step 125
6bb7993
verified
zzhang1987
commited on
Feb 28, 2025
Training in progress, step 15
b6c100a
verified
zzhang1987
commited on
Feb 26, 2025
Training in progress, step 25
0292305
verified
zzhang1987
commited on
Feb 19, 2025
Training in progress, step 30
9be011a
verified
zzhang1987
commited on
Feb 14, 2025
Training in progress, step 100
0c880ba
verified
zzhang1987
commited on
Feb 13, 2025
Training in progress, step 5
1fdaf51
verified
zzhang1987
commited on
Feb 12, 2025