taicheng guo
taicheng
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Group-in-Group Policy Optimization for LLM Agent Training
upvoted
a
paper
about 1 month ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
liked
a model
2 months ago
meta-llama/Llama-3.2-3B