NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 19 days ago • 35
Rethinking State Tracking in Recurrent Models Through Error Control Dynamics Paper • 2605.07755 • Published May 8 • 24