Running Featured 161 SmolVLM realtime WebGPU ⚡ 161 Ask questions about your webcam view and get text answers
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 538k • 1.6k
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 181k • 1.6k