12 11

yyx

RuggingHace

AI & ML interests

None yet

Recent Activity

upvoted an article 27 days ago

Custom Kernels for All from Codex and Claude

liked a model 27 days ago

MiniMaxAI/MiniMax-M2.5

upvoted an article about 1 month ago

Training Design for Text-to-Image Models: Lessons from Ablations

View all activity

Organizations

None yet

upvoted an article 27 days ago

Article

Custom Kernels for All from Codex and Claude

29 days ago

•

liked a model 27 days ago

MiniMaxAI/MiniMax-M2.5

Text Generation • 229B • Updated 3 days ago • 493k • • 1.18k

upvoted 2 articles about 1 month ago

Article

Training Design for Text-to-Image Models: Lessons from Ablations

Feb 3

•

Article

Text-to-image Architectural Experiments

Nov 13, 2025

•

upvoted a paper about 1 month ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106

upvoted an article about 1 month ago

Article

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

Jan 27

•

upvoted an article about 2 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

liked a model 2 months ago

MiniMaxAI/MiniMax-M2.1

Text Generation • 229B • Updated 28 days ago • 58.1k • • 1.27k

upvoted an article 4 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

454

liked a Space 4 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

liked a model 4 months ago

bigscience/bloom

Text Generation • 176B • Updated Jul 28, 2023 • 6.85k • 4.99k

liked 2 Spaces 4 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Generate a curated web‑text dataset for LLM training

The Smol Training Playbook

📚

3.04k

The secrets to building world-class LLMs

liked a model 5 months ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated Dec 23, 2025 • 198k • • 1.49k

upvoted an article 5 months ago

Article

What is test-time compute and how to scale it?

Feb 6, 2025

•

117

upvoted an article 6 months ago

Article

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Sep 2, 2025

•

upvoted an article 7 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18, 2025

•

liked a model 8 months ago

RedHatAI/quantization

Updated Jul 27, 2025 • 6

upvoted an article 8 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

764

upvoted a paper 9 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

yyx

AI & ML interests

Recent Activity

Organizations

RuggingHace's activity

Custom Kernels for All from Codex and Claude

Training Design for Text-to-Image Models: Lessons from Ablations

Text-to-image Architectural Experiments

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

You could have designed state of the art positional encoding

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

FineWeb: decanting the web for the finest text data at scale

The Smol Training Playbook

What is test-time compute and how to scale it?

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

SmolLM3: smol, multilingual, long-context reasoner