Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability.
Qihan Ren
jasonrqh
AI & ML interests
explainable AI, LLM
Recent Activity
authored a paper about 4 hours ago
Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration? authored a paper about 5 hours ago
ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety authored a paper about 5 hours ago
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability