Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-9_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated about 21 hours ago • 32
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_10_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated about 21 hours ago • 48
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-7_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated about 21 hours ago • 43
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_10_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated about 21 hours ago • 51
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-9_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated about 21 hours ago • 36
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-5_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated about 21 hours ago • 33
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-1_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated about 21 hours ago • 43
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-7_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated about 21 hours ago • 39
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-3_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated about 21 hours ago • 43
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-3_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated about 21 hours ago • 50
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split10 Viewer • Updated Dec 4, 2025 • 5.59k • 5