1 6 1

Chengzu Li

chengzu

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

submitted a paper 1 day ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

upvoted a paper 4 days ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

View all activity

Organizations

upvoted a paper 1 day ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Paper • 2601.21037 • Published 10 days ago • 13

submitted a paper to Daily Papers 1 day ago

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Paper • 2601.21037 • Published 10 days ago • 13

upvoted a paper 4 days ago

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published 5 days ago • 16

updated a model 12 days ago

chengzu/bagel

Updated 12 days ago

published a model 24 days ago

chengzu/bagel

Updated 12 days ago

upvoted a paper 3 months ago

Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs

Paper • 2510.24514 • Published Oct 28, 2025 • 22

upvoted a paper 5 months ago

Lost in Embeddings: Information Loss in Vision-Language Models

Paper • 2509.11986 • Published Sep 15, 2025 • 29

updated a dataset 8 months ago

chengzu/topviewrs

Updated Jun 7, 2025 • 63 • 3

upvoted 2 papers 9 months ago

Imagine while Reasoning in Space: Multimodal Visualization-of-Thought

Paper • 2501.07542 • Published Jan 13, 2025 • 3

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16, 2025 • 57

authored 2 papers 9 months ago

Imagine while Reasoning in Space: Multimodal Visualization-of-Thought

Paper • 2501.07542 • Published Jan 13, 2025 • 3

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16, 2025 • 57

authored 6 papers about 1 year ago

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

Paper • 2201.05966 • Published Jan 16, 2022 • 1

On Task Performance and Model Calibration with Supervised and Self-Ensembled In-Context Learning

Paper • 2312.13772 • Published Dec 21, 2023

liked a dataset over 1 year ago

chengzu/topviewrs

Updated Jun 7, 2025 • 63 • 3

updated a dataset over 1 year ago

chengzu/topviewrs

Updated Jun 7, 2025 • 63 • 3

Chengzu Li

AI & ML interests

Recent Activity

Organizations

chengzu's activity