This collection is used to store series of models on the project Reinforcement Learning for Reflection Ability of Math Reasoning Model.
SII-BoHuang
SII-BoHuang
AI & ML interests
SII is an institution dedicated to innovation in education and research in the field of AI. Bo Huang is part of SII, focusing on LLM Agent.
Recent Activity
upvoted a paper 3 days ago
Hybrid Policy Distillation for LLMs updated a collection about 1 month ago
Project--Math Reasoning Model Reflection updated a collection about 1 month ago
Project--Math Reasoning Model ReflectionOrganizations
None yet