Submitted by
Soujanya Poria
AI & ML interests
Natural Language Processing
Recent Activity
View all activity
Papers
GRAIL: Gradient-Reweighted Advantages for Reinforcement Learning with Verifiable Rewards
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics