Official model collection for the paper "TokenPacker: Efficient Visual Projector for Multimodal LLM"
LI WENTONG
sunshine-lwt
AI & ML interests
Computer Vision, Multimodal AI
Recent Activity
upvoted a paper 6 days ago
InstructSAM: Segment Any Instance with Any Instructions liked a dataset 2 months ago
allenxinn/AgentVLN-Instruct