-
Aligning Teacher with Student Preferences for Tailored Training Data Generation
Paper • 2406.19227 • Published • 25 -
Pre-training Distillation for Large Language Models: A Design Space Exploration
Paper • 2410.16215 • Published • 17 -
Baichuan Alignment Technical Report
Paper • 2410.14940 • Published • 51 -
MiniPLM: Knowledge Distillation for Pre-Training Language Models
Paper • 2410.17215 • Published • 16
By
ByRookie
AI & ML interests
None yet
Recent Activity
liked
a model 1 day ago
miromind-ai/MiroThinker-1.7 upvoted a paper 4 months ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research liked
a Space 4 months ago
HuggingFaceTB/smol-training-playbook