ch-min/Qwen2.5-VL-3B-Instruct-data_scale_exp_800k-20251114_120221 Image-Text-to-Text • 4B • Updated 1 day ago • 37
ch-min/Qwen2.5-VL-3B-Instruct-data_scale_exp_400k-20251114_120221 Image-Text-to-Text • 4B • Updated 1 day ago • 35
ch-min/Qwen2.5-VL-3B-Instruct-data_scale_exp_2m-20260109_120517 Image-Text-to-Text • 4B • Updated 1 day ago • 42
ch-min/Qwen2.5-VL-3B-Instruct-data_scale_exp_80k-20251114_120221 Image-Text-to-Text • 4B • Updated 1 day ago • 41
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 4 days ago • 38
Why Far Looks Up — Data-Scale Fine-tuned Checkpoints Collection Code: https://github.com/cheolhong0916/contrastive-probing • 8 items • Updated 3 days ago