arxiv:2510.00492
Jiongdao Jin
jiongdao
AI & ML interests
None yet
Recent Activity
upvoted a paper 5 days ago
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients updated a model 9 days ago
jiongdao/grpo-outputs updated a dataset 9 days ago
jiongdao/grpo-results