arxiv:2507.06272
Linger Deng
dle666
AI & ML interests
OCR CV
Recent Activity
upvoted a paper about 7 hours ago
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously authored
a paper
about 1 month ago
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric
Reasoning in Large Multimodal Models authored
a paper
about 1 month ago
LIRA: Inferring Segmentation in Large Multi-modal Models with Local
Interleaved Region Assistance