Collections
Discover the best community collections!
Collections including paper arxiv:2512.08269
-
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper • 2512.08269 • Published • 116 -
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Paper • 2512.08765 • Published • 128 -
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
Paper • 2512.09363 • Published • 71 -
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
Paper • 2512.08478 • Published • 76
-
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding
Paper • 2512.17532 • Published • 65 -
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
Paper • 2512.19693 • Published • 62 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 89 -
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper • 2512.08269 • Published • 116
-
yandex/stable-diffusion-3.5-medium-alchemist
Text-to-Image • Updated • 3 • 6 -
Ovis-U1 Technical Report
Paper • 2506.23044 • Published • 61 -
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Paper • 2507.01953 • Published • 18 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 76
-
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance
Paper • 2412.05355 • Published • 8 -
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion
Paper • 2412.04301 • Published • 40 -
PanoDreamer: 3D Panorama Synthesis from a Single Image
Paper • 2412.04827 • Published • 10 -
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation
Paper • 2412.06781 • Published • 23
-
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding
Paper • 2512.17532 • Published • 65 -
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
Paper • 2512.19693 • Published • 62 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 89 -
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper • 2512.08269 • Published • 116
-
yandex/stable-diffusion-3.5-medium-alchemist
Text-to-Image • Updated • 3 • 6 -
Ovis-U1 Technical Report
Paper • 2506.23044 • Published • 61 -
FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model
Paper • 2507.01953 • Published • 18 -
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
Paper • 2507.01945 • Published • 76
-
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper • 2512.08269 • Published • 116 -
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Paper • 2512.08765 • Published • 128 -
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
Paper • 2512.09363 • Published • 71 -
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
Paper • 2512.08478 • Published • 76
-
MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance
Paper • 2412.05355 • Published • 8 -
SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion
Paper • 2412.04301 • Published • 40 -
PanoDreamer: 3D Panorama Synthesis from a Single Image
Paper • 2412.04827 • Published • 10 -
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation
Paper • 2412.06781 • Published • 23