arxiv:2601.09708
Min-Hung Chen
AI & ML interests
Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning
Recent Activity
new activity 2 days ago
nvidia/4D-RGPT-8B:fix links liked a model 2 days ago
nvidia/4D-RGPT-8B upvoted a paper 5 days ago
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models