arxiv:2411.19477
Xuchen Pan
panxuchen
ยท
AI & ML interests
None yet
Recent Activity
new activity 4 days ago
agentscope-ai/QwenPaw-Flash-2B:Update README.md upvoted a paper 2 months ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models