DREAM: Where Visual Understanding Meets Text-to-Image Generation Paper • 2603.02667 • Published 3 days ago • 4
Think Then Embed: Generative Context Improves Multimodal Embedding Paper • 2510.05014 • Published Oct 6, 2025
StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding Paper • 2508.15717 • Published Aug 21, 2025 • 1
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis Paper • 2211.09117 • Published Nov 16, 2022
HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions Paper • 2304.00387 • Published Apr 1, 2023
Xray-Visual Models: Scaling Vision models on Industry Scale Data Paper • 2602.16918 • Published 15 days ago • 1