openai/gsm8k
Benchmark • Updated • 17.6k • 933k • 1.32k
A math reasoning model achieving 84.84% on GSM8K through preference optimization.
| Model | GSM8K Accuracy |
|---|---|
| Abel-7B-002 (base) | 79.08% |
| + SFT | 84.46% |
| + DPO | 84.84% |
Fine-tuned with LoRA adapters using a two-stage approach:
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("dylxnmyl/DylanDeep-Core-8B-DPO")
tokenizer = AutoTokenizer.from_pretrained("dylxnmyl/DylanDeep-Core-8B-DPO")
License
This model is released under CC BY-NC-ND 4.0 with the following conditions:
Non-commercial use only
No derivatives without permission
Attribution required
Additionally, this model inherits the LLaMA 2 Community License from its base model. Users must comply with both licenses. MODELCARD