Hiroto N. PRO

hironow

AI & ML interests

AI Agent, LLM, Audio, Animate

Recent Activity

liked a model 43 minutes ago

nvidia/nemotron-speech-streaming-en-0.6b

upvoted an article 43 minutes ago

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

liked a model 6 days ago

zai-org/SCAIL-Preview

View all activity

Organizations

upvoted an article 43 minutes ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

3 days ago

•

upvoted an article 7 days ago

Article

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Sep 26, 2025

•

upvoted a paper about 1 month ago

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation

Paper • 2510.06961 • Published Oct 8, 2025 • 10

upvoted a collection about 1 month ago

ShieldGemma Release

Collection

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated Jul 10, 2025 • 15

upvoted an article about 1 month ago

Article

Introducing SynthID Text

Oct 23, 2024

•

upvoted a paper about 2 months ago

Wan-Animate: Unified Character Animation and Replacement with Holistic Replication

Paper • 2509.14055 • Published Sep 17, 2025 • 17

upvoted 2 articles about 2 months ago

Article

Training Flux Locally on Mac

Sep 12, 2024

•

Article

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Sep 2, 2025

•

upvoted an article 3 months ago

Article

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

Sep 23, 2025

•

upvoted a collection 4 months ago

Mem-Agent

Collection

Small sized agents from Dria trained on interacting with an obsidian-like memory system using python tools. Trained on Qwen3-4B-Thinking-2507. • 4 items • Updated Sep 5, 2025 • 4

upvoted an article 4 months ago

Article

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Sep 11, 2025

•

upvoted 6 articles 5 months ago

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

Apr 16, 2025

•

Article

17 Reasons Why Gradio Isn't Just Another UI Library

Apr 16, 2025

•

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

May 23, 2025

•

170

Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Jun 6, 2025

•

Article

ScreenEnv: Deploy your full stack Desktop Agent

Jul 10, 2025

•

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5, 2025

•

508

upvoted a paper 6 months ago

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 157

upvoted an article 8 months ago

Article

How to Build an MCP Server with Gradio

Apr 30, 2025

•

201

upvoted an article 9 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25, 2025

•

172

Hiroto N. PRO

AI & ML interests

Recent Activity

Organizations

hironow's activity

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Introducing SynthID Text

Training Flux Locally on Mac

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

Introducing HELMET: Holistically Evaluating Long-context Language Models

17 Reasons Why Gradio Isn't Just Another UI Library

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

ScreenEnv: Deploy your full stack Desktop Agent

Welcome GPT OSS, the new open-source model family from OpenAI!

How to Build an MCP Server with Gradio

FastRTC: The Real-Time Communication Library for Python