End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer Paper • 2605.00503 • Published 7 days ago • 8
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 5 days ago • 141
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 14 days ago • 224
A Systematic Study of Cross-Modal Typographic Attacks on Audio-Visual Reasoning Paper • 2604.03995 • Published Apr 5 • 4
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published about 1 month ago • 38
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published about 1 month ago • 187
CUE-R: Beyond the Final Answer in Retrieval-Augmented Generation Paper • 2604.05467 • Published Apr 7 • 7