Cosmos3 Collection Omnimodal World Models for Physical AI • 15 items • Updated about 10 hours ago • 36
SmartDirector: Keyframe-Conditioned Cinematic Video Generation with Narrative Pacing Control Paper • 2605.27891 • Published 6 days ago • 4 • 3
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 6 days ago • 68
RTDMD Collection Reinforcing Few-step Generators via Reward-Tilted Distribution Matching • 3 items • Updated 6 days ago • 2
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching Paper • 2605.26108 • Published 8 days ago • 5
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published 11 days ago • 45
SEGA: Spectral-Energy Guided Attention for Resolution Extrapolation in Diffusion Transformers Paper • 2605.22668 • Published 12 days ago • 40