Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
26
29
ZHANG HAO
26hzhang
Follow
Swrooy's profile picture
Sicong's profile picture
RavRana's profile picture
12 followers
·
25 following
https://26hzhang.github.io/
hzhang26
26hzhang
hzhang26
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models
upvoted
a
paper
9 days ago
Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model
upvoted
a
paper
13 days ago
Improving Data and Reward Design for Scientific Reasoning in Large Language Models
View all activity
Organizations
Papers
12
arxiv:
2510.11693
arxiv:
2509.21268
arxiv:
2509.17437
arxiv:
2507.22607
Expand 12 papers
models
0
None public yet
datasets
6
Sort: Recently updated
26hzhang/math_dapo_qwen2.5-math-7b_rollout_n_10
Viewer
•
Updated
Nov 15, 2025
•
17.4k
•
6
26hzhang/math_7.5k_qwen2.5-math-7b_rollout_n_10
Viewer
•
Updated
Nov 15, 2025
•
7.5k
•
7
26hzhang/math_dapo_qwen3-1.7b_rollout_n_10
Viewer
•
Updated
Nov 15, 2025
•
17.4k
•
17
26hzhang/math_7.5k_qwen3-1.7b_rollout_n_10
Viewer
•
Updated
Nov 14, 2025
•
7.5k
•
7
26hzhang/math_dapo_qwen3-4b_rollout_n_10
Viewer
•
Updated
Nov 11, 2025
•
17.4k
•
32
26hzhang/math_7.5k_qwen3-4b_rollout_n_10
Viewer
•
Updated
Nov 7, 2025
•
7.5k
•
13