Andy Xu's picture

3

Andy Xu

andaero

·

andaero

AI & ML interests

Computational Materials Generation | AI4Science | Reinforcement Learning

Recent Activity

authored a paper about 1 month ago

PLaID++: A Preference Aligned Language Model for Targeted Inorganic Materials Design

updated a model 4 months ago

HOPE-Lab-HMC/PLaID

upvoted an article 10 months ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

View all activity

Organizations

authored a paper about 1 month ago

PLaID++: A Preference Aligned Language Model for Targeted Inorganic Materials Design

Paper • 2509.07150 • Published Sep 8, 2025

updated a model 4 months ago

HOPE-Lab-HMC/PLaID

Updated Oct 17, 2025

upvoted an article 10 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

274

upvoted a paper 11 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18, 2025 • 144

upvoted a paper over 1 year ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 71