Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_42_rule Updated about 17 hours ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_42_rule Updated about 17 hours ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_fnr_no_bracket_0p0_0p0_1p0_grpo_dr_grpo_42_rule Updated 2 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_dr_grpo_42_rule Updated 2 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_sapo_42_rule Updated 2 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_fnr_no_bracket_0p0_0p0_1p0_grpo_dr_grpo_42_rule Updated 2 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_0p5_0p0_1p0_grpo_dr_grpo_42_rule Text Generation • 7B • Updated 2 days ago • 419
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_sapo_42_rule Text Generation • 2B • Updated 2 days ago • 464
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_dr_grpo_42_rule Text Generation • 2B • Updated 2 days ago • 258
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p8_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 2 days ago • 400
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_0p5_0p0_1p0_grpo_42_rule Text Generation • 7B • Updated 2 days ago • 385
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p8_0p0_1p0_grpo_dr_grpo_42_rule Text Generation • 2B • Updated 2 days ago • 366
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_fnr_eng_1p0_0p0_1p0_grpo_42_rule Text Generation • 7B • Updated 2 days ago • 770