Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8, 2025 • 10
ShieldGemma Release Collection A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated Jul 10, 2025 • 15
Wan-Animate: Unified Character Animation and Replacement with Holistic Replication Paper • 2509.14055 • Published Sep 17, 2025 • 17
view article Article Make your ZeroGPU Spaces go brrr with ahead-of-time compilation +2 Sep 2, 2025 • 71
Mem-Agent Collection Small sized agents from Dria trained on interacting with an obsidian-like memory system using python tools. Trained on Qwen3-4B-Thinking-2507. • 4 items • Updated Sep 5, 2025 • 4
view article Article mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL Sep 11, 2025 • 25
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 Apr 16, 2025 • 40
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code +2 May 23, 2025 • 170
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! +1 Jun 6, 2025 • 55
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 508
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 71