16 17 10

Boyuan Sun

BBBBCHAN

https://bbbbchan.github.io/

BBBBCHAN

AI & ML interests

None yet

Recent Activity

submitted a paper 5 days ago

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

liked a model 9 days ago

BBBBCHAN/SWIM-7B

updated a model 10 days ago

BBBBCHAN/SWIM-7B

View all activity

Organizations

submitted a paper to Daily Papers 5 days ago

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

Paper • 2605.18018 • Published 12 days ago • 32

liked a model 9 days ago

BBBBCHAN/SWIM-7B

Video-Text-to-Text • 8B • Updated 10 days ago • 39 • 2

updated a model 10 days ago

BBBBCHAN/SWIM-7B

Video-Text-to-Text • 8B • Updated 10 days ago • 39 • 2

liked a dataset 10 days ago

BBBBCHAN/NL-Refer

Preview • Updated 10 days ago • 260 • 2

published a dataset 10 days ago

BBBBCHAN/NL-Refer

Preview • Updated 10 days ago • 260 • 2

updated a dataset 10 days ago

BBBBCHAN/NL-Refer

Preview • Updated 10 days ago • 260 • 2

published a model 10 days ago

BBBBCHAN/SWIM-7B

Video-Text-to-Text • 8B • Updated 10 days ago • 39 • 2

authored 2 papers 11 days ago

GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

Paper • 2602.12617 • Published Feb 13 • 20

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

Paper • 2605.18018 • Published 12 days ago • 32

upvoted a paper 11 days ago

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

Paper • 2605.18018 • Published 12 days ago • 32

upvoted a paper about 1 month ago

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Paper • 2604.25819 • Published Apr 28 • 17

liked a model about 2 months ago

ghost233lism/GeoAgent

Image-Text-to-Text • 8B • Updated Apr 8 • 104 • 3

liked a dataset 3 months ago

AudioVisual-Caption/ASID-1M

Viewer • Updated Mar 11 • 241k • 732 • 83

upvoted a paper 3 months ago

GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

Paper • 2602.12617 • Published Feb 13 • 20

liked a dataset 3 months ago

ghost233lism/GeoSeek

Viewer • Updated Feb 21 • 33k • 889 • 1

commented a paper 10 months ago

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Paper • 2508.01548 • Published Aug 3, 2025 • 14 •

upvoted 3 papers 10 months ago

A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models

Paper • 2508.01548 • Published Aug 3, 2025 • 14

Towards RAW Object Detection in Diverse Conditions

Paper • 2411.15678 • Published Nov 24, 2024 • 1

Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness

Paper • 2501.07978 • Published Jan 14, 2025 • 1

liked a Space 10 months ago

DepthAnything AC

🌖

This is official demo of Depth Anything At Any Condition

Boyuan Sun

AI & ML interests

Recent Activity

Organizations

BBBBCHAN's activity

DepthAnything AC