Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published 1 day ago • 83
GENIUS: Generative Fluid Intelligence Evaluation Suite Paper • 2602.11144 • Published 2 days ago • 52
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published 1 day ago • 37
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing Paper • 2602.12205 • Published 1 day ago • 65
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published 4 days ago • 172
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling 1 day ago • 37
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 2 days ago • 15
PhyCritic: Multimodal Critic Models for Physical AI Paper • 2602.11124 • Published 2 days ago • 49
Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models Paper • 2602.10224 • Published 3 days ago • 17
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning Paper • 2602.08234 • Published 5 days ago • 64
Chain of Mindset: Reasoning with Adaptive Cognitive Modes Paper • 2602.10063 • Published 3 days ago • 69
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems Paper • 2602.08847 • Published 5 days ago • 23
Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion Paper • 2602.07775 • Published 6 days ago • 7
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 9 days ago • 305
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published 4 days ago • 185
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models Paper • 2602.07026 • Published 12 days ago • 133
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research Paper • 2602.06540 • Published 8 days ago • 20
NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models Paper • 2602.06694 • Published 8 days ago • 13