Hongmutian

OriReplication

7 7

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

LongStraw: Long-Context RL Beyond 2M Tokens under a Fixed GPU Budget

liked a dataset 17 days ago

nvidia/Nemotron-SFT-ARC-AGI-v1

liked a model about 1 month ago

mindlab-research/Macaron-V1-Preview-749B

View all activity

Organizations

None yet

upvoted a paper 2 days ago

LongStraw: Long-Context RL Beyond 2M Tokens under a Fixed GPU Budget

Paper • 2607.14952 • Published 4 days ago • 174

upvoted 2 papers about 2 months ago

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published Jun 1 • 239

Macaron-A2UI: A Model for Generative UI in Personal Agents

Paper • 2605.24830 • Published May 24 • 84

upvoted 2 papers 2 months ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published May 13 • 224

δ-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published May 12 • 132

upvoted a paper 3 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 114

upvoted a paper 9 months ago

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Paper • 2510.08483 • Published Oct 9, 2025 • 24