Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces Paper • 2604.08362 • Published 5 days ago • 12
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 12 days ago • 464
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 11 days ago • 351
VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors Paper • 2604.02486 • Published 12 days ago • 9
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 216
Video Models Reason Early: Exploiting Plan Commitment for Maze Solving Paper • 2603.30043 • Published 14 days ago • 14
Aladien/voicetune-whisper-7ec23800-1775333953 Automatic Speech Recognition • 37.8M • Updated 9 days ago • 27 • 1