Arghya Bhattacharya

orgh0

AI & ML interests

Natural Language Processing

Recent Activity

reacted to kavyamanohar's post with 🚀 about 5 hours ago

Releasing Vividh-ASR — an open benchmark and models for Hindi and Malayalam ASR. Vividh-ASR is built from public data, stratified by complexity: → Clean recordings → Noisy and accented speech → Spontaneous, conversational audio Alongside the benchmark, we release: → Open models for Hindi and Malayalam → A training recipe with two counterintuitive choices that moved the needle → What failed, not just what worked The stratified evaluation methodology transfers directly to any low-resource language setup — beyond Hindi and Malayalam. Built at @adalatai, where we build speech tech for Indian courts. This is our first open contribution back to the community. @janaab @Kush0610 @orgh0 Link: https://huggingface.co/blog/adalat-ai/vividh-benchmark

upvoted an article 5 days ago

Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages

updated a dataset over 1 year ago

orgh0/openwhisper

View all activity

Organizations

reacted to kavyamanohar's post with 🚀 about 5 hours ago

Post

Releasing Vividh-ASR — an open benchmark and models for Hindi and Malayalam ASR.

Vividh-ASR is built from public data, stratified by complexity:
→ Clean recordings
→ Noisy and accented speech
→ Spontaneous, conversational audio

Alongside the benchmark, we release:
→ Open models for Hindi and Malayalam
→ A training recipe with two counterintuitive choices that moved the needle
→ What failed, not just what worked

The stratified evaluation methodology transfers directly to any low-resource language setup — beyond Hindi and Malayalam.

Built at @adalatai , where we build speech tech for Indian courts. This is our first open contribution back to the community. @janaab @Kush0610 @orgh0

Link: https://huggingface.co/blog/adalat-ai/vividh-benchmark

upvoted an article 5 days ago

Article

Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages

adalat-ai

•

5 days ago

• 11

updated a dataset over 1 year ago

orgh0/openwhisper

Viewer • Updated Feb 13, 2025 • 435k • 8

published a dataset over 1 year ago

orgh0/openwhisper

Viewer • Updated Feb 13, 2025 • 435k • 8

Arghya Bhattacharya

AI & ML interests

Recent Activity

Organizations

orgh0's activity

Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages