Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper ⢠2601.09088 ⢠Published 10 days ago ⢠57 ⢠6
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper ⢠2601.09088 ⢠Published 10 days ago ⢠57
Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation Paper ⢠2512.20908 ⢠Published about 1 month ago ⢠25
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection Paper ⢠2601.09195 ⢠Published 10 days ago ⢠15
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob Viewer ⢠Updated 8 days ago ⢠435k ⢠4.11k ⢠49
Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks Paper ⢠2601.03448 ⢠Published 17 days ago ⢠12
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper ⢠2601.02346 ⢠Published 18 days ago ⢠26
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper ⢠2512.20578 ⢠Published about 1 month ago ⢠81
Guided Self-Evolving LLMs with Minimal Human Supervision Paper ⢠2512.02472 ⢠Published Dec 2, 2025 ⢠53
TimeBill: Time-Budgeted Inference for Large Language Models Paper ⢠2512.21859 ⢠Published 29 days ago ⢠25