Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenMOSS-Team 's Collections
Game-RL
MOSS Transcribe Diarize
FRoM-W1
DiRL
RoboOmni
MOSS-Speech
MOSS-TTSD
MOSS Embodied Planner
Low Rank Sparse Attention
MHA2MLA-refactor
MHA2MLA
MOSS

MOSS Transcribe Diarize

updated 3 days ago

A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription.

Upvote
1

  • MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

    Paper • 2601.01554 • Published 5 days ago • 50

  • Running
    Featured
    34

    MOSS Transcribe Diarize

    🏢
    34

    Transcribe audio/video files with speaker identification

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs