Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published 11 days ago • 104
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 9 days ago • 42
From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills Paper • 2605.23899 • Published 9 days ago • 28
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 9 days ago • 204
Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published 11 days ago • 104
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published 13 days ago • 111
InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation Paper • 2605.14333 • Published 17 days ago • 34
InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation Paper • 2605.14333 • Published 17 days ago • 34
InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation Paper • 2605.14333 • Published 17 days ago • 34
EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones Paper • 2211.09703 • Published Nov 17, 2022
AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition Paper • 2112.14238 • Published Dec 28, 2021
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6, 2025 • 191
Co-GRPO: Co-Optimized Group Relative Policy Optimization for Masked Diffusion Model Paper • 2512.22288 • Published Dec 25, 2025 • 3
Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models Paper • 2604.25636 • Published Apr 28 • 24
Steering Visual Generation in Unified Multimodal Models with Understanding Supervision Paper • 2605.05781 • Published 24 days ago • 5
Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling Paper • 2605.13062 • Published 18 days ago • 33
Steering Visual Generation in Unified Multimodal Models with Understanding Supervision Paper • 2605.05781 • Published 24 days ago • 5
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 23 days ago • 98