xiaomi-research/MiLMMT-46-12B-v0.1
Translation • 12B • Updated • 112 • 2
None defined yet.
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution