Ziheng Li
ChillingDream
ยท
AI & ML interests
Natural Language Processing
Recent Activity
authored a paper 2 days ago
To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models authored a paper 2 days ago
LiveClawBench: Benchmarking LLM Agents on Complex, Real-World Assistant Tasks authored a paper 2 days ago
Trust Region On-Policy DistillationOrganizations
None yet