Lei Wang
demolei
AI & ML interests
LLMs
Recent Activity
upvoted a paper 2 days ago
Self-Distilled Agentic Reinforcement Learning upvoted a paper 4 days ago
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards