CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction Paper • 2603.00610 • Published 10 days ago • 32
AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking Paper • 2601.17645 • Published Jan 25 • 23
DUO-TOK: Dual-Track Semantic Music Tokenizer for Vocal-Accompaniment Generation Paper • 2511.20224 • Published Nov 25, 2025 • 1
CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction Paper • 2603.00610 • Published 10 days ago • 32