RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions Paper β’ 2506.03448 β’ Published Jun 3, 2025 β’ 5
An Empirical Study of Autoregressive Pre-training from Videos Paper β’ 2501.05453 β’ Published Jan 9, 2025 β’ 41
Steering Rectified Flow Models in the Vector Field for Controlled Image Generation Paper β’ 2412.00100 β’ Published Nov 27, 2024 β’ 17
TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Paper β’ 2411.02545 β’ Published Nov 4, 2024 β’ 1
Benchmark Checkpoints Collection Weights of the TripletCLIP and baselines on custom training scripts β’ 9 items β’ Updated Dec 1, 2024 β’ 2
FlowChef Collection Steering Rectified Flow Models in the Vector Field for Controlled Image Generation β’ 3 items β’ Updated Nov 30, 2024 β’ 1
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing Paper β’ 2409.01322 β’ Published Sep 2, 2024 β’ 96
Transformer Explainer: Interactive Learning of Text-Generative Models Paper β’ 2408.04619 β’ Published Aug 8, 2024 β’ 175
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models Paper β’ 2408.00735 β’ Published Aug 1, 2024 β’ 16
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement Paper β’ 2408.00653 β’ Published Aug 1, 2024 β’ 31
Tora: Trajectory-oriented Diffusion Transformer for Video Generation Paper β’ 2407.21705 β’ Published Jul 31, 2024 β’ 27
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention Paper β’ 2407.19918 β’ Published Jul 29, 2024 β’ 51
An Image is Worth 32 Tokens for Reconstruction and Generation Paper β’ 2406.07550 β’ Published Jun 11, 2024 β’ 60
Zero-shot Image Editing with Reference Imitation Paper β’ 2406.07547 β’ Published Jun 11, 2024 β’ 33
Bigger is not Always Better: Scaling Properties of Latent Diffusion Models Paper β’ 2404.01367 β’ Published Apr 1, 2024 β’ 22
Representative Papers Collection Collection of research papers published by the organization members β’ 4 items β’ Updated Mar 30, 2024 β’ 1
ECLIPSE Series Priors Collection ECLIPSE priors for kandinsky v2.2 for T2I and Personalized T2I. β’ 3 items β’ Updated Apr 12, 2024 β’ 1
Magic-Me: Identity-Specific Video Customized Diffusion Paper β’ 2402.09368 β’ Published Feb 14, 2024 β’ 31