BadCat
Foresta
·
AI & ML interests
LLMs
Deep learning
Reinforcement learning
Recent Activity
upvoted a paper 14 days ago
On the Geometry of On-Policy Distillation upvoted a paper about 2 months ago
A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping liked a Space 3 months ago
duoan/TorchCodeOrganizations
None yet