Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published Nov 13, 2025 • 48
Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective Paper • 2509.22613 • Published Sep 26, 2025 • 9
DocReward: A Document Reward Model for Structuring and Stylizing Paper • 2510.11391 • Published Oct 13, 2025 • 27
Information-Preserving Reformulation of Reasoning Traces for Antidistillation Paper • 2510.11545 • Published Oct 13, 2025 • 1
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs Paper • 2510.24514 • Published Oct 28, 2025 • 21
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published Oct 30, 2025 • 27
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning Paper • 2506.08889 • Published Jun 10, 2025 • 23
Model as a Game: On Numerical and Spatial Consistency for Generative Games Paper • 2503.21172 • Published Mar 27, 2025
Think Only When You Need with Large Hybrid-Reasoning Models Paper • 2505.14631 • Published May 20, 2025 • 20
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Paper • 2501.07542 • Published Jan 13, 2025 • 3
Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders Paper • 2104.08757 • Published Apr 18, 2021
WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale Paper • 2502.16684 • Published Feb 23, 2025 • 1