lyj
hardworkinglyj
AI & ML interests
None yet
Recent Activity
upvoted a paper about 12 hours ago
OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning upvoted a paper about 1 year ago
Thought-Augmented Policy Optimization: Bridging External Guidance and
Internal Capabilities upvoted a paper over 1 year ago
Boosting Multimodal Reasoning with MCTS-Automated Structured ThinkingOrganizations
None yet