MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference Paper • 2605.07363 • Published 4 days ago • 12
Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key Paper • 2605.06638 • Published 5 days ago • 13
AcademiClaw: When Students Set Challenges for AI Agents Paper • 2605.02661 • Published 8 days ago • 15
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models Paper • 2605.05204 • Published 6 days ago • 24
SkillOS: Learning Skill Curation for Self-Evolving Agents Paper • 2605.06614 • Published 5 days ago • 37
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 4 days ago • 57
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published 5 days ago • 90
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 4 days ago • 81
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration Paper • 2605.03042 • Published 8 days ago • 107
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 12 days ago • 211
view article Article Tropical Quivers for Modern AI: A Guided Tour of a Research Program AmelieSchreiber • Mar 22 • 3
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth mlabonne • Jul 29, 2024 • 371
S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models Paper • 2604.01168 • Published Apr 1 • 7
ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published Apr 2 • 42
How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings Paper • 2604.04323 • Published Apr 6 • 41