Classify images in real-time using labels
Transcribe spoken audio into written text
InsectSAM + GroundingDINO Inference