-
Open ASR Leaderboard configuration for Transformers π€ models
πDisplay the serverβs root folder as a web page
-
Open ASR Leaderboard configuration for NVIDIA NeMo ASR models
πNormalize text to a consistent, clean format
-
Open ASR Leaderboard configuration for Boson's Higgs Audio v3
πNormalize and clean text data for analysis
-
Open ASR Leaderboard configuration for API models
πRun evaluations and benchmark your ML models
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
View all PapersA collection of ASR models supported in π€ Transformers
-
openai/whisper-large-v2
Automatic Speech Recognition β’ 2B β’ Updated β’ 61.7k β’ 1.8k -
facebook/wav2vec2-base-960h
Automatic Speech Recognition β’ 94.4M β’ Updated β’ 1.09M β’ 398 -
facebook/wav2vec2-large-xlsr-53
Updated β’ 177k β’ 159 -
facebook/hubert-xlarge-ls960-ft
Automatic Speech Recognition β’ 1.0B β’ Updated β’ 14.5k β’ 16
A collection of audio classification models supported in π€ Transformers
A collection of codec and embedding models supported in π€ Transformers.
-
laion/clap-htsat-unfused
Feature Extraction β’ Updated β’ 494k β’ β’ 74 -
facebook/encodec_32khz
Feature Extraction β’ 59M β’ Updated β’ 45.3k β’ 18 -
descript/dac_44khz
Feature Extraction β’ 76.6M β’ Updated β’ 88.7k β’ β’ 11 -
descript/dac_24khz
Feature Extraction β’ 74.7M β’ Updated β’ 3.04k β’ β’ 3
Transformer supported versions of X-Codec models: https://github.com/zhenye234/xcodec?tab=readme-ov-file#available-models
-
hf-audio/xcodec-hubert-general-balanced
Feature Extraction β’ 0.2B β’ Updated β’ 1.01k β’ 1 -
hf-audio/xcodec-wavlm-more-data
Feature Extraction β’ 0.2B β’ Updated β’ 1.4k β’ 1 -
hf-audio/xcodec-wavlm-mls
Feature Extraction β’ 0.2B β’ Updated β’ 954 -
hf-audio/xcodec-hubert-general
Feature Extraction β’ 0.2B β’ Updated β’ 4.09k
A collection of TTS models supported in π€ Transformers.
A collection of music generation models supported in π€ Transformers and 𧨠Diffusers
-
Open ASR Leaderboard configuration for Transformers π€ models
πDisplay the serverβs root folder as a web page
-
Open ASR Leaderboard configuration for NVIDIA NeMo ASR models
πNormalize text to a consistent, clean format
-
Open ASR Leaderboard configuration for Boson's Higgs Audio v3
πNormalize and clean text data for analysis
-
Open ASR Leaderboard configuration for API models
πRun evaluations and benchmark your ML models
Transformer supported versions of X-Codec models: https://github.com/zhenye234/xcodec?tab=readme-ov-file#available-models
-
hf-audio/xcodec-hubert-general-balanced
Feature Extraction β’ 0.2B β’ Updated β’ 1.01k β’ 1 -
hf-audio/xcodec-wavlm-more-data
Feature Extraction β’ 0.2B β’ Updated β’ 1.4k β’ 1 -
hf-audio/xcodec-wavlm-mls
Feature Extraction β’ 0.2B β’ Updated β’ 954 -
hf-audio/xcodec-hubert-general
Feature Extraction β’ 0.2B β’ Updated β’ 4.09k
A collection of ASR models supported in π€ Transformers
-
openai/whisper-large-v2
Automatic Speech Recognition β’ 2B β’ Updated β’ 61.7k β’ 1.8k -
facebook/wav2vec2-base-960h
Automatic Speech Recognition β’ 94.4M β’ Updated β’ 1.09M β’ 398 -
facebook/wav2vec2-large-xlsr-53
Updated β’ 177k β’ 159 -
facebook/hubert-xlarge-ls960-ft
Automatic Speech Recognition β’ 1.0B β’ Updated β’ 14.5k β’ 16
A collection of TTS models supported in π€ Transformers.
A collection of audio classification models supported in π€ Transformers
A collection of music generation models supported in π€ Transformers and 𧨠Diffusers
A collection of codec and embedding models supported in π€ Transformers.
-
laion/clap-htsat-unfused
Feature Extraction β’ Updated β’ 494k β’ β’ 74 -
facebook/encodec_32khz
Feature Extraction β’ 59M β’ Updated β’ 45.3k β’ 18 -
descript/dac_44khz
Feature Extraction β’ 76.6M β’ Updated β’ 88.7k β’ β’ 11 -
descript/dac_24khz
Feature Extraction β’ 74.7M β’ Updated β’ 3.04k β’ β’ 3