Chanuk Lee
tally0818
AI & ML interests
LLM post-training
Recent Activity
upvoted a paper about 4 hours ago
The Unlearnability Phenomenon in RLVR for Language Models upvoted a paper about 4 hours ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper about 6 hours ago
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMsOrganizations
None yet