Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published 17 days ago • 39
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 28 days ago • 37
UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation Paper • 2601.11522 • Published Jan 16 • 17
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning Paper • 2601.10129 • Published Jan 15 • 11
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following Paper • 2601.06431 • Published Jan 10 • 12
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published Jan 15 • 28
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published Jan 13 • 148
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published Jan 15 • 155