14 21

Zhao Zihao

xishze

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

upvoted a paper 4 days ago

Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation

upvoted a paper 5 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

View all activity

Organizations

None yet

upvoted a paper 3 days ago

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

Paper • 2605.30888 • Published 10 days ago • 10

upvoted a paper 4 days ago

Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation

Paper • 2605.29861 • Published 11 days ago • 16

upvoted a paper 5 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 10 days ago • 56

liked a dataset 6 days ago

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 212k • 1.24k

liked a model 7 days ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.7M • 3.27k

upvoted a paper 9 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 12 days ago • 420

liked a dataset 10 days ago

world-igr-plum/regions

Updated Jun 17, 2025 • 387k • 23

liked a model 14 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 12 days ago • 22.9k • • 1.1k

liked a dataset 15 days ago

HuggingFaceFW/finephrase

Viewer • Updated Mar 31 • 1.02B • 478k • 124

liked a model 16 days ago

tencent/Hy-MT2-30B-A3B

Translation • 30B • Updated 12 days ago • 6.25k • 450

upvoted a paper 16 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 27 days ago • 195

liked a model 16 days ago

seraphimzzzz/824092

Updated 16 days ago • 1

upvoted 2 papers 17 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published 20 days ago • 186

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

Paper • 2605.15055 • Published 25 days ago • 19

liked a model 20 days ago

nataliaaolmo/distilhubert-urbansound8k-finetuned1

23.7M • Updated 20 days ago • 24 • 1

liked a dataset 27 days ago

maifoundations/VideoOdyssey

Viewer • Updated 11 days ago • 100 • 1.1k • 7

liked 2 models about 1 month ago

rafathasan/temp

Updated about 1 hour ago • 1

Theogott/spr-qwen3_5-9b-dora-vramsafe-gguf

Text Generation • 9B • Updated May 1 • 62 • 1

liked a dataset about 2 months ago

wegrthj/yzbw0u-akrw-raw

Preview • Updated Apr 28 • 2.68k • 1

upvoted a paper about 2 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

Zhao Zihao

AI & ML interests

Recent Activity

Organizations

xishze's activity