arxiv:2512.24618
Xiaoyu Tan
WIlliam1900
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
moonshotai/Kimi-K2.5
authored
a paper
23 days ago
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive
Exploration for Agentic Reinforcement Learning