Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking Paper • 2606.03985 • Published 3 days ago • 37
VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization Paper • 2606.02564 • Published 4 days ago • 29
General-Instinct/InstinctRazor-Qwen3.5-122B-A10B-GGUF Text Generation • 122B • Updated about 21 hours ago • 306 • 11
Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories Paper • 2606.02060 • Published 4 days ago • 38
LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards Paper • 2605.31584 • Published 7 days ago • 41
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published 7 days ago • 56
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction Paper • 2605.05242 • Published May 3 • 120
Running 178 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 178 Building and scaling RL environments for LLM training
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Paper • 2605.28556 • Published 9 days ago • 61
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue Paper • 2605.30993 • Published 7 days ago • 56
HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive Image-Text-to-Text • 35B • Updated Apr 17 • 2.65M • 1.38k
SWE-chat: Coding Agent Interactions From Real Users in the Wild Paper • 2604.20779 • Published Apr 22 • 16