P Lin
pufflin
AI & ML interests
None yet
Recent Activity
upvoted a paper 11 days ago
Guidance Contrastive Token Credit Assignment for Discrete Policy Optimization upvoted a paper 11 days ago
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses liked a model 11 days ago
pat-jj/harness-1Organizations
None yet