GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration Paper • 2605.31039 • Published 7 days ago • 41
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 10 days ago • 138
Running on Zero Agents Featured 2.53k Qwen Image Multiple Angles 3D Camera 🎥 2.53k Transform image viewpoint with adjustable camera angles
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper • 2511.23002 • Published Nov 28, 2025 • 26
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published Jul 1, 2025 • 256