PAIWorld: A 3D-Consistent World Foundation Model for Robotic Manipulation Paper • 2606.18375 • Published 7 days ago • 11
MaineCoon: Pursuing A Real-Time Audio-Visual Social World Model Paper • 2606.17800 • Published 7 days ago • 13
OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data Paper • 2606.13432 • Published 12 days ago • 106
Avatar V: Scaling Video-Reference Avatar Video Generation Paper • 2606.13872 • Published 12 days ago • 9
Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation Paper • 2606.17030 • Published 8 days ago • 28
PermaVid: Consistent Video Generation Across Edits via Disentangled Context Memory Paper • 2606.16449 • Published 8 days ago • 5
Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus Paper • 2606.15345 • Published 10 days ago • 16
ActWorld: From Explorable to Interactive World Model via Action-Aware Memory Paper • 2606.17730 • Published 7 days ago • 8
iMaC: Translating Actions into Motion and Contact Images for Embodied World Models Paper • 2606.09813 • Published 15 days ago • 13
MBench: A Comprehensive Benchmark on Memory Capability for Video World Models Paper • 2606.00793 • Published 15 days ago • 11
World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 13 days ago • 26
EgoCS-400K: An Egocentric Gameplay Dataset for World Models Paper • 2606.18180 • Published 7 days ago • 15
DreamX-World 1.0: A General-Purpose Interactive World Model Paper • 2606.16993 • Published 8 days ago • 108
BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering Paper • 2606.17049 • Published 8 days ago • 27