sherry

rain305

30 8

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring

upvoted a paper about 20 hours ago

Neglected Free Lunch from Post-training: Progress Advantage for LLM Agents

upvoted a paper 3 days ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

View all activity

Organizations

None yet

upvoted 2 papers about 20 hours ago

Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring

Paper • 2605.30834 • Published May 29 • 10

Neglected Free Lunch from Post-training: Progress Advantage for LLM Agents

Paper • 2606.26080 • Published 7 days ago • 10

upvoted a paper 3 days ago

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Paper • 2606.26907 • Published 6 days ago • 47

upvoted a paper 8 days ago

DF3DV-1K: A Large-Scale Dataset and Benchmark for Distractor-Free Novel View Synthesis

Paper • 2604.13416 • Published 13 days ago • 32

upvoted a paper 17 days ago

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 22 days ago • 192

upvoted 4 papers 20 days ago

Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

Paper • 2605.18740 • Published May 18 • 5

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 26 days ago • 69

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 22 days ago • 41

SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning

Paper • 2606.10804 • Published 22 days ago • 51

upvoted a paper 2 months ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published Apr 13 • 103

liked a model 3 months ago

Skywork/Skywork-UniPic-1.5B

Any-to-Any • Updated Sep 8, 2025 • 77 • 116

upvoted a paper 3 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113

liked a model 3 months ago

jdopensource/JoyAI-Image-Edit

Image-to-Image • Updated May 7 • 183 • 130

upvoted a paper 3 months ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

liked a model 3 months ago

CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1

Image-Text-to-Text • 73B • Updated Oct 25, 2025 • 77 • 4

upvoted a paper 4 months ago

Evaluating and Steering Modality Preferences in Multimodal Large Language Model

Paper • 2505.20977 • Published May 27, 2025 • 10

upvoted an article 4 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 167

upvoted 2 papers 4 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Paper • 2603.08652 • Published Mar 9 • 41

liked a dataset 4 months ago

LanguageBind/UniWorld-V1

Viewer • Updated Jun 16, 2025 • 7.11k • 1.23k • 26

sherry

AI & ML interests

Recent Activity

Organizations

rain305's activity

NEO-unify: Building Native Multimodal Unified Models End to End