InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation Paper • 2605.14333 • Published 8 days ago • 32
InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation Paper • 2605.14333 • Published 8 days ago • 32
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones Paper • 2211.09703 • Published Nov 17, 2022
AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition Paper • 2112.14238 • Published Dec 28, 2021
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6, 2025 • 192
Co-GRPO: Co-Optimized Group Relative Policy Optimization for Masked Diffusion Model Paper • 2512.22288 • Published Dec 25, 2025 • 3
Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models Paper • 2604.25636 • Published 24 days ago • 24
Steering Visual Generation in Unified Multimodal Models with Understanding Supervision Paper • 2605.05781 • Published 15 days ago • 4
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning Paper • 2504.13820 • Published Apr 18, 2025 • 16
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance Paper • 2504.13065 • Published Apr 17, 2025