Staged post-training along the perception → reasoning capability axis. Models, datasets, paper. ICML 2026.
-
UCSC-VLAA/VLM-CapCurriculum-Qwen3-VL-8B-Staged
Image-Text-to-Text • 9B • Updated -
UCSC-VLAA/VLM-CapCurriculum-Qwen2.5-VL-7B-Staged
Image-Text-to-Text • 8B • Updated -
UCSC-VLAA/VLM-CapCurriculum-InternVL3-8B-Staged
Image-Text-to-Text • 8B • Updated -
UCSC-VLAA/VLM-CapCurriculum-InternVL3.5-8B-Staged
Image-Text-to-Text • 9B • Updated