Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Jason Wei
JWei05
Follow
0 followers
·
1 following
AI & ML interests
RL, LLMs, DL Theory
Recent Activity
updated
a model
about 17 hours ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-27bptw20-step80
published
a model
about 17 hours ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-27bptw20-step80
updated
a model
about 17 hours ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-27bptw20-step80
View all activity
Organizations
models
13
Sort: Recently updated
JWei05/gemma3-12b-pt-off-policy-distilled-from-27bptw20-step80
Updated
about 11 hours ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-27bptw20-step80
Updated
about 16 hours ago
JWei05/dapo-gemma3-27b-pt-warmup20
Updated
about 17 hours ago
JWei05/dapo-gemma3-27b-it-warmup20
Updated
2 days ago
JWei05/gemma3-12b-it-off-policy-distilled-from-gemma4-31b
Updated
2 days ago
JWei05/gemma3-4b-it-off-policy-distilled-from-gemma4-31b
Updated
2 days ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-dapo27b
Updated
3 days ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-dapo27b
Updated
3 days ago
JWei05/gemma3-12b-it-off-policy-distilled-from-dapo27b-correct
Updated
3 days ago
JWei05/gemma3-4b-it-off-policy-distilled-from-dapo27b-correct
Updated
3 days ago
View 13 models
datasets
38
Sort: Recently updated
JWei05/DAPO-Gemma3-27B-PT-warmup20-step80-SFT-Data
Viewer
•
Updated
about 17 hours ago
•
34.8k
•
6
JWei05/DAPO-Gemma4-31B-IT-SFT-Data
Viewer
•
Updated
2 days ago
•
34.8k
•
14
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data-correct
Viewer
•
Updated
3 days ago
•
41.8k
•
21
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data
Viewer
•
Updated
3 days ago
•
69.6k
•
22
JWei05/swe_smith_py_qwen3.5_35b_trajs_1952
Viewer
•
Updated
7 days ago
•
2k
•
49
JWei05/swe_smith_rs_qwen3.5_35b_trajs_2477
Viewer
•
Updated
7 days ago
•
5k
•
41
JWei05/swe_smith_go_qwen3.5_35b_trajs_1448
Viewer
•
Updated
7 days ago
•
1.63k
•
38
JWei05/swe_smith_js_qwen3.5_35b_trajs_4358
Viewer
•
Updated
7 days ago
•
5k
•
43
JWei05/swe_smith_java_qwen3.5_35b_trajs_4369
Viewer
•
Updated
7 days ago
•
5k
•
52
JWei05/swe_smith_js_5902_filtered
Viewer
•
Updated
18 days ago
•
5.9k
•
31
View 38 datasets