Richard Zhuang PRO

RZ412

·

https://richardzhuang0412.github.io

AI & ML interests

LLM Routing, LLM + Games, Post-Training, Agents

Recent Activity

updated a dataset 23 days ago

DCAgent2/dev_set_v2_Nemotron_Terminal_8B_20260701_041825

published a dataset 23 days ago

DCAgent2/dev_set_v2_Nemotron_Terminal_8B_20260701_041825

authored a paper 29 days ago

OpenThoughts-Agent: Data Recipes for Agentic Models

View all activity

Organizations

Papers 3

arxiv:2606.24855

arxiv:2501.08328

arxiv:2410.02223

models 57

RZ412/Qwen2.5-3B-Instruct-inferredbugs-sandboxes-traces-terminus-2

Updated Dec 4, 2025

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-Min-R1-Min-MLR

Text Generation • 3B • Updated Nov 30, 2025 • 6

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-Only-Seed-42

Text Generation • 3B • Updated Nov 3, 2025 • 2

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-RM-50-50-SS-42-AS-42

Text Generation • 3B • Updated Nov 3, 2025 • 6

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-Only-Seed-42

Text Generation • 3B • Updated Nov 3, 2025 • 4

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-MeL

Text Generation • 3B • Updated Oct 28, 2025 • 1

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-ML

Text Generation • 3B • Updated Oct 27, 2025 • 8

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-MaL-misstore

Text Generation • 3B • Updated Oct 27, 2025 • 10

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-DB

Text Generation • 3B • Updated Oct 26, 2025 • 3

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-RES

Text Generation • 3B • Updated Oct 26, 2025 • 5

datasets 54

RZ412/opencode_sft_traces

Viewer • Updated Apr 13 • 17 • 21

RZ412/test_harbor_trace

Viewer • Updated Apr 3 • 97 • 15

RZ412/test_harbor_trace-summarization-9-summary

Viewer • Updated Apr 3 • 1 • 11

RZ412/test_harbor_trace-summarization-9-questions

Viewer • Updated Apr 3 • 1 • 18

RZ412/test_harbor_trace-summarization-8-summary

Viewer • Updated Apr 3 • 1 • 19

RZ412/test_harbor_trace-summarization-8-questions

Viewer • Updated Apr 3 • 1 • 15

RZ412/test_harbor_trace-summarization-7-summary

Viewer • Updated Apr 3 • 1 • 33

RZ412/test_harbor_trace-summarization-7-questions

Viewer • Updated Apr 3 • 1 • 11

RZ412/test_harbor_trace-summarization-6-summary

Viewer • Updated Apr 3 • 1 • 13

RZ412/test_harbor_trace-summarization-6-questions

Viewer • Updated Apr 3 • 1 • 14

View 54 datasets