Towards the Aha Moment of Vision-Language Models
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 9
MMInstruction/Qwen2-VL-72B-Video-T3
73B • Updated • 5
MMInstruction/Giraffe
8B • Updated • 5 • 2
MMInstruction/LongVA-7B-Video-T3
8B • Updated • 3
MMInstruction/Qwen-VL-ArXivCap
Text Generation • Updated • 363 • 4
MMInstruction/Qwen-VL-ArXivQA
Text Generation • Updated • 367 • 4
MMInstruction/Silkie
Text Generation • Updated • 377 • 12
MMInstruction/YingVLM
Updated • 32 • 1
MMInstruction/YingVLM-zh
Updated • 28
MMInstruction/YingVLM-Video
Updated • 28
datasets 17
MMInstruction/stock_factors
Viewer • Updated • 48.2M • 3.02k • 2
MMInstruction/OSWorld-G
Viewer • Updated • 510 • 494 • 6
MMInstruction/VL-RewardBench
Viewer • Updated • 1.25k • 771 • 15
MMInstruction/Video-T3-QA
Viewer • Updated • 162k • 249 • 2
MMInstruction/SuperClevr_Val
Viewer • Updated • 5k • 50 • 1
MMInstruction/Clevr_CoGenT_TrainA_R1
Viewer • Updated • 37.8k • 314 • 48
MMInstruction/Clevr_CoGenT_TrainA_70K_Complex
Viewer • Updated • 70k • 2.03k • 8
MMInstruction/Clevr_CoGenT_ValB
Viewer • Updated • 5k • 31 • 2
MMInstruction/Clevr_CoGenT_ValA
Viewer • Updated • 5k • 301 • 1
MMInstruction/Clevr_CoAgent_TrainA_R1
Viewer • Updated • 2.5k • 83