Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dipta007
/
dagger-4B_GRPO
like
0
Text Generation
Transformers
Safetensors
dipta007/dagger
dipta007/DistractMath-Bn
Bengali
English
gemma3
image-to-text
math
reasoning
computational-graph
bangla
low-resource
distractor-aware
grpo
reinforcement-learning
small-model
conversational
text-generation-inference
arxiv:
2601.06853
License:
gemma
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
updated readme
#1
by
zabir-nabil
- opened
16 days ago
base:
refs/heads/main
←
from:
refs/pr/1
Discussion
Files changed
+64
-8
zabir-nabil
16 days ago
No description provided.
updated readme
6d8c1817
dipta007
changed pull request status to
merged
16 days ago
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment