1 41

Hiroaki OGASAWARA

xhiroga

AI & ML interests

None yet

Recent Activity

liked a model 27 days ago

deepseek-ai/DeepSeek-OCR-2

liked a Space about 1 month ago

Qwen/Qwen3-TTS

updated a dataset about 2 months ago

xhiroga/data

View all activity

Organizations

liked a model 27 days ago

deepseek-ai/DeepSeek-OCR-2

Image-Text-to-Text • Updated 22 days ago • 1.56M • 808

liked a Space about 1 month ago

Qwen3-TTS Demo

🎙

1.53k

Generate custom speech from text, voice descriptions, or samples

updated a dataset about 2 months ago

xhiroga/data

Viewer • Updated Jan 3 • 1 • 152 • 1

liked a dataset 3 months ago

Seed3D/Articulation-XL2.0

Updated Sep 19, 2025 • 194 • 29

liked a model 3 months ago

VAST-AI/UniRig

Updated Aug 1, 2025 • 75

liked a model 4 months ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated Dec 10, 2025 • 335k • 1.58k

liked a Space 4 months ago

Open ASR Leaderboard

🏆

1.22k

Explore ASR model performance across languages and datasets

liked a model 4 months ago

nguyenvulebinh/AV-HuBERT-MuAViC-multilingual

Text Generation • 0.4B • Updated Mar 6, 2025 • 2 • 2

liked a model 5 months ago

meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 1.27M • 700

upvoted a paper 5 months ago

Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations

Paper • 2503.06273 • Published Mar 8, 2025 • 6

liked a model 5 months ago

fierce-cats/beatrice-trainer

Audio-to-Audio • Updated Aug 30, 2025 • 38

updated a dataset 5 months ago

xhiroga/hiroga-speech

Updated Sep 14, 2025 • 4

published a dataset 5 months ago

xhiroga/hiroga-speech

Updated Sep 14, 2025 • 4

liked 3 models 6 months ago

liked 2 Spaces 7 months ago

LLM Embeddings Explained: A Visual and Intuitive Guide

🚀

325

How Language Models Turn Text into Meaning, From Traditional

Mitsua Likes Demo

🚀

Text-to-Image Diffusion Model trained on licensed/pd data