Train one epoch SFT on UltraChat200K
Zizhuo Zhang PRO
resistz
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory updated a model 5 months ago
resistz/GT-GRPO_Llama-3.2-3B-Instruct_NQ-HotpotQA published a model 5 months ago
resistz/GT-GRPO_Llama-3.2-3B-Instruct_NQ-HotpotQA