Kazuki1450/Qwen3-1.7B-Base_csum_6_10_clean_huge_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated about 6 hours ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_stepbystep_0p5_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 2 days ago • 22
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_stepbystep_S_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 30 days ago • 2
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_python_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 30 days ago • 3
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_Certainly_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 30 days ago • 3
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_difficult_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 30 days ago • 2
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_stepbystep_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 30 days ago • 3
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_Therefore_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 30 days ago • 2
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_he_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 30 days ago • 2
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_mazu_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 30 days ago • 5
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split10 Viewer • Updated Dec 4, 2025 • 5.59k • 8