Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
9
2
Taeho Hwang
doubleyyh
Follow
ddindidu's profile picture
kreamsoup's profile picture
juyoungml's profile picture
10 followers
ยท
45 following
ThisIsHwang
AI & ML interests
None yet
Recent Activity
new
activity
7 days ago
Qwen/Qwen3.5-27B:
Value error, Model architectures ['Qwen3_5ForConditionalGeneration'] are not supported for now. Transformers version 5.3.0.dev0
reacted
to
sergiopaniego
's
post
with ๐
19 days ago
TRL v0.27.0 is out!! ๐ฅณ It includes GDPO, the latest variant of GRPO for multi-reward RL โจ GDPO decouples reward normalization to avoid reward collapse and improve per-reward convergence โ developed by @sliuau @SimonX et al. Explore the paper: https://huggingface.co/papers/2601.05242 Explore the full set of changes here: https://github.com/huggingface/trl/releases/tag/v0.27.0
liked
a Space
30 days ago
SamsungResearch/TRUEBench
View all activity
Organizations
doubleyyh
's models
4
Sort:ย Recently updated
doubleyyh/email-tuned-qwen2-lora
Text Generation
โข
Updated
Dec 26, 2024
โข
4
doubleyyh/mixed-bge-m3-email
Sentence Similarity
โข
0.6B
โข
Updated
Dec 25, 2024
doubleyyh/exit-gemma-2b
Updated
Dec 21, 2024
โข
1
doubleyyh/exit-gemma-7b
Updated
Dec 21, 2024