Shudong Zhang
shudong
ยท
AI & ML interests
None yet
Recent Activity
upvoted an article 19 days ago
From GRPO to DAPO and GSPO: What, Why, and How upvoted an article 3 months ago
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Organizations
None yet