Classify images in real-time using labels
Transcribe and translate audio files into text
InsectSAM + GroundingDINO Inference