Tool Calling Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning Paper • 2509.23285 • Published Sep 27, 2025 • 14
Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning Paper • 2509.23285 • Published Sep 27, 2025 • 14
Tool Calling Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning Paper • 2509.23285 • Published Sep 27, 2025 • 14
Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning Paper • 2509.23285 • Published Sep 27, 2025 • 14
JameSand/qwen3-1.7b-base-svd-muon-adam-lr3e-6-minV-bs128-kl0.0-stampede3-global_step_300 2B • Updated 21 days ago • 163
JameSand/qwen3-1.7b-base-svd-muon-adam-lr3e-6-minNone-bs128-kl0.0-stampede3-global_step_300 2B • Updated 21 days ago • 155
JameSand/qwen3-1.7b-base-svd-muon-adam-lr3e-6-minV-bs128-kl0.0-stampede3-global_step_200 2B • Updated 21 days ago • 163
JameSand/qwen3-1.7b-base-svd-muon-adam-lr3e-6-minNone-bs128-kl0.0-stampede3-global_step_200 2B • Updated 21 days ago • 162
JameSand/qwen3-1.7b-base-svd-muon-adam-ulr-3e-6-vlr-0-bs128-kl0.0-stampede3-global_step_200 2B • Updated 24 days ago • 9
JameSand/qwen3-1.7b-base-svd-muon-adam-ulr-0-vlr-3e-6-bs128-kl0.0-stampede3-global_step_200 2B • Updated 24 days ago • 11