RLVR Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' virtuoussy/Qwen2.5-7B-Instruct-RLVR 8B • Updated May 4, 2025 • 50 • 17 virtuoussy/Math-RLVR Viewer • Updated Apr 16, 2025 • 782k • 102 • 9 virtuoussy/Multi-subject-RLVR Viewer • Updated Apr 16, 2025 • 579k • 143 • 66
RLVR Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' virtuoussy/Qwen2.5-7B-Instruct-RLVR 8B • Updated May 4, 2025 • 50 • 17 virtuoussy/Math-RLVR Viewer • Updated Apr 16, 2025 • 782k • 102 • 9 virtuoussy/Multi-subject-RLVR Viewer • Updated Apr 16, 2025 • 579k • 143 • 66