MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification β’ 86.6M β’ Updated
β’ 449k β’ 340
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Generate music from lyrics and genre tags
Generate speech audio from text with voice and emotion tweaks
Generate audio from text, video, or audio prompts
Generate expressive speech from text and voice reference
Music Generation Foundation Model v1.5