FluidInference/parakeet-realtime-eou-120m-coreml Automatic Speech Recognition β’ Updated Mar 14 β’ 16.4k β’ 4
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated Dec 10, 2025 β’ 357k β’ 1.59k
HuggingFaceTB/SmolVLM2-500M-Video-Instruct Image-Text-to-Text β’ Updated Apr 8, 2025 β’ 394k β’ 133
openai/clip-vit-large-patch14 Zero-Shot Image Classification β’ 0.4B β’ Updated Sep 15, 2023 β’ 21M β’ 2k
Runtime error Agents Featured 272 Edit Video By Editing Text β 272 Audio-based video editing using AI-generated transcription