LMMs-Lab

community

https://www.lmms-lab.com/

EvolvingLMMs-Lab

AI & ML interests

Feeling and building the multimodal intelligence.

Recent Activity

yl-1993 authored a paper 3 days ago

Apple-$π$: Benchmarking Thinking with Video Towards Law-Grounded Physical Intelligence

pufanyi authored a paper 3 days ago

Mage-Flow: An Efficient Native-Resolution Foundation Model for Image Generation and Editing

THUdyh authored a paper 4 days ago

VideoChat3: Fully Open Video MLLM for Efficient and Generalist Video Understanding

View all activity

Papers

SkillOpt-Lite: Better and Faster Agent Self-evolution via One Line of Vibe

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

View all Papers

lmms-lab 's Spaces 7

README

LiveBench

LLaVA OneVision 1.5

Interact with a multimodal chatbot using text and images

Aero 1 Audio Demo

Demo for Aero-1-Audio

Multimodal SAE

Demo for Multimodal-SAE

LLaVA-NeXT-Interleave-Demo

LongVA Demo