Cola Chen (SII)'s picture

Cola Chen (SII)

141forever

·

https://141forever.github.io/

141forever

AI & ML interests

None yet

Recent Activity

new activity 2 months ago

HuggingFaceH4/on-policy-distillation:How to reproduce the results in your blog?

liked a Space 3 months ago

HuggingFaceH4/on-policy-distillation

new activity 3 months ago

HuggingFaceH4/on-policy-distillation:About lr and evaluation

View all activity

Organizations

New activity in HuggingFaceH4/on-policy-distillation 2 months ago

How to reproduce the results in your blog?

#7 opened 3 months ago by

liked a Space 3 months ago

Unlocking On-Policy Distillation for Any Model Family

Visualize on-policy distillation for any model family

New activity in HuggingFaceH4/on-policy-distillation 3 months ago

About lr and evaluation

#6 opened 4 months ago by

upvoted an article 5 months ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Nov 19, 2025

•

34

upvoted a paper 7 months ago

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Paper • 2510.14967 • Published Oct 16, 2025 • 34