Mingsong_Li's picture

1 7 2

Mingsong_Li

Mingsong07

https://lms-07.github.io/

lms-07

AI & ML interests

None yet

Recent Activity

liked a Space 2 months ago

HuggingFaceTB/smol-training-playbook

upvoted a paper 3 months ago

The Art of Scaling Reinforcement Learning Compute for LLMs

updated a collection 3 months ago

LLM-Reasoning-Data

View all activity

Organizations

None yet

upvoted 2 papers 3 months ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15, 2025 • 31

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Paper • 2510.02263 • Published Oct 2, 2025 • 8

upvoted 3 papers 4 months ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18, 2025 • 33

MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks

Paper • 2509.14638 • Published Sep 18, 2025 • 11

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 661

upvoted 2 papers 5 months ago

MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

Paper • 2508.05257 • Published Aug 7, 2025 • 13

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

Paper • 2508.07785 • Published Aug 11, 2025 • 28