Parse GUI screenshots into labeled icons and text
Extract UI components from a screenshot
Detect objects in images using text prompts
Monitor server load and status
Track processing progress of files