허주원's picture

5 9

허주원

jacksonhernande

AI & ML interests

None yet

Recent Activity

liked a dataset 7 days ago

Kgshop/fullmeta

upvoted a paper 8 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

liked a model 9 days ago

theSOL1/cas4133-assn2-dpo

View all activity

Organizations

None yet

upvoted a paper 8 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 11 days ago • 204

upvoted a paper 10 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 19 days ago • 195

upvoted a paper 16 days ago

HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution

Paper • 2605.09942 • Published 20 days ago • 15

upvoted a paper 29 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published about 1 month ago • 217

upvoted a paper about 2 months ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Paper • 2604.05091 • Published Apr 6 • 47