TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions Paper • 2602.08711 • Published 4 days ago • 25
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning Paper • 2602.08234 • Published 4 days ago • 64
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 8 days ago • 35
Efficient Autoregressive Video Diffusion with Dummy Head Paper • 2601.20499 • Published 16 days ago • 8
Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 11 days ago • 19
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper • 2602.03796 • Published 10 days ago • 56
NativeTok: Native Visual Tokenization for Improved Image Generation Paper • 2601.22837 • Published 14 days ago • 9
DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning Paper • 2601.21716 • Published 15 days ago • 13