Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 12 days ago • 189
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 10 days ago • 142
PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents Paper • 2605.19932 • Published 5 days ago • 7
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 246
jina-embeddings-v5-omni: Text-Geometry-Preserving Multimodal Embeddings via Frozen-Tower Composition Paper • 2605.08384 • Published 16 days ago • 10
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 17 days ago • 211
A Foundation Model for Zero-Shot Logical Rule Induction Paper • 2605.04916 • Published 18 days ago • 4
Instruction-Guided Poetry Generation in Arabic and Its Dialects Paper • 2604.27766 • Published 24 days ago • 4
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 24 days ago • 218
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 121
Watch Before You Answer: Learning from Visually Grounded Post-Training Paper • 2604.05117 • Published Apr 6 • 36
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342