Running Agents 45 Automatic Speech Recognition ๐ 45 Transcribe uploaded, recorded, or online audio to text
๐ MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" โข 11 items โข Updated Mar 2 โข 68
OFA-Sys/chinese-clip-vit-base-patch16 Zero-Shot Image Classification โข Updated Dec 9, 2022 โข 94.5k โข 126