Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b
Viewer • Updated • 306k • 2.37k • 320
None defined yet.
On the Step Length Confounding in LLM Reasoning Data Selection
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning