-
Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations
Paper • 2508.09789 • Published • 5 -
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents
Paper • 2508.13186 • Published • 19 -
ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents
Paper • 2508.04038 • Published • 1 -
Prompt Orchestration Markup Language
Paper • 2508.13948 • Published • 48
Saini
shankars
AI & ML interests
None yet
Recent Activity
updated
a collection
4 days ago
AI-paper
updated
a collection
4 days ago
AI-paper
upvoted
a
paper
about 1 month ago
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
Organizations
None yet