Generate music from text prompts
Upscale images up to 4x with optional face restoration
Generate natural-sounding speech from text with consistent voice
Generate images from text prompts
Generate subtitles from audio or video files