Generate speaker‑labeled transcript from an audio file
Reasoning model specialized for OCR/Markdown generation.