Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
WendiLi's picture
1 5

WendiLi

Windy0822
HaloMaster's profile picture junwux's profile picture dark-pen's profile picture
·

AI & ML interests

None yet

Organizations

None yet

Windy0822 's collections 2

GEB
General Exploratory Bonus for optimistic exploration in RLHF
  • Windy0822/LLaMA3_geb_p_kl

    8B • Updated Sep 30, 2025 • 1
  • Windy0822/LLaMA3_geb_f_kl

    8B • Updated Sep 30, 2025 • 2
  • Windy0822/LLaMA3_geb_tanh_kl

    8B • Updated Sep 30, 2025 • 1
  • Windy0822/LLaMA3_geb_p_hel

    8B • Updated Sep 30, 2025 • 5
ImplicitPRM
  • Windy0822/ImplicitPRM_DPO

    Text Generation • 8B • Updated Dec 4, 2024 • 3 • 2
  • Windy0822/ImplicitPRM_CE

    Text Generation • 8B • Updated Dec 8, 2024 • 7 • 2
  • Windy0822/ultrainteract_math_rollout

    Viewer • Updated Dec 5, 2024 • 32.9k • 12 • 9
GEB
General Exploratory Bonus for optimistic exploration in RLHF
  • Windy0822/LLaMA3_geb_p_kl

    8B • Updated Sep 30, 2025 • 1
  • Windy0822/LLaMA3_geb_f_kl

    8B • Updated Sep 30, 2025 • 2
  • Windy0822/LLaMA3_geb_tanh_kl

    8B • Updated Sep 30, 2025 • 1
  • Windy0822/LLaMA3_geb_p_hel

    8B • Updated Sep 30, 2025 • 5
ImplicitPRM
  • Windy0822/ImplicitPRM_DPO

    Text Generation • 8B • Updated Dec 4, 2024 • 3 • 2
  • Windy0822/ImplicitPRM_CE

    Text Generation • 8B • Updated Dec 8, 2024 • 7 • 2
  • Windy0822/ultrainteract_math_rollout

    Viewer • Updated Dec 5, 2024 • 32.9k • 12 • 9
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs