🔄 In a Training Loop

Keylhan Paumard--André

keypa

5 31 295

keypaa

AI & ML interests

Efficient deep learning, LLM fine-tuning, inference optimization, model compression, distributed training, GPU systems, open-source AI infrastructure

Recent Activity

upvoted an article 1 day ago

Kimi K3 Model Overview: 2.8T Parameters, MXFP4 Quantization, and What the Open Weights Mean for the Community

liked a dataset 1 day ago

greghavens/kimi-k3-coding-and-debugging-traces

liked a model 2 days ago

baseten/glm-52-debug

View all activity

Organizations

Collections 1

models 20

datasets 13

keypa/reap-calibration-v1-filtered

Viewer • Updated 22 days ago • 21k • 49

keypa/reap-calibration-v1-full

Viewer • Updated 22 days ago • 23.1k • 43

keypa/reaper-calibration

Viewer • Updated Jun 9 • 1.09M • 49 • 1

keypa/qwen36-adapter-vision-sft

Viewer • Updated May 5 • 10k • 5

keypa/qwen36-adapter-longcontext-sft

Viewer • Updated May 5 • 5.09k • 15

keypa/qwen36-adapter-cyber-sft

Viewer • Updated May 5 • 52 • 3

keypa/qwen36-adapter-medical-sft

Viewer • Updated May 5 • 16.2k • 4

keypa/qwen36-adapter-math-grpo

Viewer • Updated May 5 • 50k • 4

keypa/qwen36-adapter-code-sft

Viewer • Updated May 5 • 100k • 6

keypa/gpt-oss-calibration-data

Viewer • Updated Jan 18 • 15.6k • 45

View 13 datasets

Keylhan Paumard--André

AI & ML interests

Recent Activity

Organizations

Collections 1

The Art of Scaling Reinforcement Learning Compute for LLMs

Attention Is All You Need for KV Cache in Diffusion LLMs

BitNet Distillation

GigaBrain-0: A World Model-Powered Vision-Language-Action Model

The Art of Scaling Reinforcement Learning Compute for LLMs

Attention Is All You Need for KV Cache in Diffusion LLMs

BitNet Distillation

GigaBrain-0: A World Model-Powered Vision-Language-Action Model

models 20

keypa/oracle-gemma4-12b-lora

keypa/oracle-gemma4-12b-GGUF

keypa/oracle-gemma4-12b

keypa/soren-9b-stage-0-0_identity_v1

keypa/soren-9b-stage-1-1_reasoning_warmup

keypa/qwen36-27b-adapters-suite

keypa/qwen36-27b-adapter-math-grpo

keypa/qwen36-27b-adapter-vision-sft

keypa/qwen36-27b-adapter-cyber-sft

keypa/qwen36-27b-adapter-code-sft

datasets 13

keypa/reap-calibration-v1-filtered

keypa/reap-calibration-v1-full

keypa/reaper-calibration

keypa/qwen36-adapter-vision-sft

keypa/qwen36-adapter-longcontext-sft

keypa/qwen36-adapter-cyber-sft

keypa/qwen36-adapter-medical-sft

keypa/qwen36-adapter-math-grpo

keypa/qwen36-adapter-code-sft

keypa/gpt-oss-calibration-data

Keylhan Paumard--André

AI & ML interests

Recent Activity

Organizations

Collections 1

models 20 Sort: Recently updated

datasets 13 Sort: Recently updated

models 20

datasets 13