-
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Paper • 2310.14566 • Published • 27 -
TouchStone: Evaluating Vision-Language Models by Language Models
Paper • 2308.16890 • Published • 1
donghunlee
hundong2
AI & ML interests
None yet
Recent Activity
liked
a model
11 days ago
nvidia/nemotron-speech-streaming-en-0.6b
upvoted
a
collection
16 days ago
Falcon-H1R
liked
a model
about 1 month ago
XiaomiMiMo/MiMo-V2-Flash
Organizations
None yet