Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Yifan Peng

pyf98

Alphonsce's profile picture

mp1704's profile picture

rizwanishaq's profile picture

·

https://pyf98.github.io

pyf98
yifan-peng

AI & ML interests

Multimodal LLMs, Speech-to-Speech, Speech Recognition

Organizations

pyf98 's collections 1

Open Whisper-style Speech Models (OWSM)

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/

Configuration error

Agents

10

OWSM V4 Demo

🌍

10

This is a demo for OWSM-V4 CTC and medium model.
Runtime error

Agents

Featured

55

OWSM Demo

🔊

55
espnet/yodas_owsmv4

Viewer • Updated Sep 1, 2025 • 4 • 5.27k • 17
espnet/owsm_ctc_v4_1B

Automatic Speech Recognition • Updated Apr 7 • 11.8k • 13

Open Whisper-style Speech Models (OWSM)

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/

Configuration error

Agents

10

OWSM V4 Demo

🌍

10

This is a demo for OWSM-V4 CTC and medium model.
Runtime error

Agents

Featured

55

OWSM Demo

🔊

55
espnet/yodas_owsmv4

Viewer • Updated Sep 1, 2025 • 4 • 5.27k • 17
espnet/owsm_ctc_v4_1B

Automatic Speech Recognition • Updated Apr 7 • 11.8k • 13

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs