6 4 5

Zhizhuo Yang

george614

AI & ML interests

LLM, LMM, GenAI, RL, Robotics

Recent Activity

updated a dataset 13 days ago

Writer/p1-AA-rollout-human-filtered

new activity 13 days ago

Writer/p1-AA-rollout-human-filtered:Upload aa_human_filtered_clean_clean.jsonl with huggingface_hub

published a dataset 13 days ago

Writer/p1-AA-rollout-human-filtered

View all activity

Organizations

updated a dataset 13 days ago

Writer/p1-AA-rollout-human-filtered

Viewer • Updated 13 days ago • 484 • 14

New activity in Writer/p1-AA-rollout-human-filtered 13 days ago

Upload aa_human_filtered_clean_clean.jsonl with huggingface_hub

#1 opened 13 days ago by

george614

published a dataset 13 days ago

Writer/p1-AA-rollout-human-filtered

Viewer • Updated 13 days ago • 484 • 14

liked a dataset about 2 months ago

hkust-nlp/Toolathlon-Trajectories

Preview • Updated Dec 5, 2025 • 1.58k • 18

liked a Space 3 months ago

The Ultra-Scale Playbook

🌌

3.63k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 4 months ago

nvidia/HelpSteer2

Viewer • Updated Dec 18, 2024 • 21.4k • 12k • 435

upvoted an article 6 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

718

upvoted a collection 6 months ago

RL+reason model

Collection

258 items • Updated 10 days ago • 22

liked 2 datasets 7 months ago

galileo-ai/agent-leaderboard

Viewer • Updated Jul 16, 2025 • 1.28k • 239 • 33

hypervariance/function-calling-sharegpt

Viewer • Updated Dec 8, 2023 • 86.9k • 169 • 38

upvoted a paper 7 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 277

upvoted a paper 11 months ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10, 2025 • 133

Zhizhuo Yang

AI & ML interests

Recent Activity

Organizations

george614's activity

Upload aa_human_filtered_clean_clean.jsonl with huggingface_hub

The Ultra-Scale Playbook

Finally, a Replacement for BERT: Introducing ModernBERT