Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
s
august66
Follow
Kyleyee's profile picture
callmespring's profile picture
mamba413's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
august66/hh_qwen1.5_IS_KL
updated
a dataset
4 days ago
august66/hh_helpfulness_qwen2.5_1.5b_generation
updated
a dataset
5 days ago
august66/hh_helpfulness_mc_rewards
View all activity
Organizations
august66
's models
13
Sort: Recently updated
august66/hh_qwen1.5_IS_KL
2B
•
Updated
4 days ago
•
24
august66/hh_qwen1.5_IS_CLIP_round_2
2B
•
Updated
6 days ago
•
17
august66/hh_qwen1.5_drpo_fixed_beta
2B
•
Updated
9 days ago
•
31
august66/hh_qwen1.5_IS_CLIP
2B
•
Updated
9 days ago
•
34
august66/hh_qwen1.5_drpo_adaptive_beta
Updated
10 days ago
august66/hh_qwen1.5_is_clip_1000_5e6
2B
•
Updated
10 days ago
•
22
august66/hh_qwen_1.5b_sft_dpo_model
2B
•
Updated
11 days ago
•
76
august66/hh_qwen1.5_drpo_target_3.0_1000_checkpoint
2B
•
Updated
12 days ago
•
12
august66/qwen2.5-1.5b-base-hh-helpful-sft
Text Generation
•
2B
•
Updated
17 days ago
•
352
august66/Qwen2.5-1.5B-Instruct-reward-hh-helpful
Text Classification
•
2B
•
Updated
17 days ago
•
14
august66/ultrafeedback_qwen_1.5b_drpo_model
Updated
Jul 9, 2025
august66/qwen2-sft-dpo-imdb-beta-1.0
Updated
Jun 2, 2025
august66/qwen2-sft-final
Text Generation
•
0.5B
•
Updated
Jun 1, 2025