VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper β’ 2605.16079 β’ Published 13 days ago β’ 28
Running on Zero MCP Featured 1.35k FireRed Image Edit 1.0 Fast π 1.35k FireRed-Image-Edit Γ Qwen-Image-Edit-Rapid (Transformers)
Running on Zero MCP 107 Qwen Image Edit 2509 LoRAs Fast β‘ 107 Demo of the Collection of Qwen Image Editing LoRAs
FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization Paper β’ 2605.15824 β’ Published 13 days ago β’ 62
alibaba-multimodal-industrial-ai/IndustryBench Viewer β’ Updated 15 days ago β’ 2.05k β’ 284 β’ 29
deepseek-ai/DeepSeek-V4-Flash Text Generation β’ 158B β’ Updated 22 days ago β’ 3.09M β’ β’ 1.25k
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper β’ 2605.02881 β’ Published 24 days ago β’ 341
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper β’ 2604.11784 β’ Published Apr 13 β’ 143