Kien Nguyen
kiennt
·
AI & ML interests
None yet
Organizations
Vision Language Model
-
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing
Paper • 2311.00571 • Published • 43 -
Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?
Paper • 2311.00047 • Published • 10 -
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Paper • 2310.08825 • Published • 1 -
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving
Paper • 2311.05332 • Published • 11
Code LLM
Vision Language Model
-
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing
Paper • 2311.00571 • Published • 43 -
Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?
Paper • 2311.00047 • Published • 10 -
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Paper • 2310.08825 • Published • 1 -
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving
Paper • 2311.05332 • Published • 11
models
0
None public yet
datasets
0
None public yet