arxiv:2505.20081
TengXiao
TTTXXX01
AI & ML interests
None yet
Organizations
models 96
TTTXXX01/SFT_model
7B • Updated • 2
TTTXXX01/bce_0.1_800step
8B • Updated • 1
TTTXXX01/global_step_1100
8B • Updated • 4
TTTXXX01/global-step-920
8B • Updated • 3
TTTXXX01/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
TTTXXX01/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation • 2B • Updated • 4
TTTXXX01/Qwen2.5-1.5B-Open-R1-Distill
Text Generation • 2B • Updated • 4
TTTXXX01/LLama-8B-Instruct-v0.1-MI-6e-7
8B • Updated • 2
TTTXXX01/LLama-8B-Instruct-v0.1-MI-2e-5
8B • Updated • 1
TTTXXX01/LLama-8B-Instruct-v0.1-MI-5e-7
8B • Updated • 1
datasets 106
TTTXXX01/Teng_MATH_6K_Clustering
Viewer • Updated • 6k • 8
TTTXXX01/MATH-mix-6Ks-k60-spc100
Viewer • Updated • 6k • 43
TTTXXX01/DPO_Orz-30K_filtered
Viewer • Updated • 3k • 4
TTTXXX01/DPO_MathSub-30K_filtered
Viewer • Updated • 3k • 8
TTTXXX01/DPO_AceReason-Math_filtered
Viewer • Updated • 6.6k • 4
TTTXXX01/DPO_DAPO-Math-17k-Processed_filtered
Viewer • Updated • 2.58k • 42
TTTXXX01/MathSub-30K
Viewer • Updated • 9k • 11
TTTXXX01/MathSub-30K-up
Viewer • Updated • 9k • 32
TTTXXX01/diverse-semi-verifiable-tasks-o3-7500-o4-mini-high
Viewer • Updated • 10k • 3
TTTXXX01/new-wildchat-english-general
Viewer • Updated • 19k • 2