In a Training Loop 🔄

1 3 27

Michael Benayoun

michaelbenayoun

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

aws-neuron/optimum-neuron-cache

upvoted an article 2 months ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

updated a model 3 months ago

optimum-internal-testing/optimum-neuron-cache-ci

View all activity

Organizations

updated a model about 1 month ago

aws-neuron/optimum-neuron-cache

Updated about 15 hours ago • 29

upvoted an article 2 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

276

updated a model 3 months ago

optimum-internal-testing/optimum-neuron-cache-ci

Updated about 4 hours ago

updated 2 models 4 months ago

michaelbenayoun/qwen3-tiny-4kv-heads-8layers-random

Text Generation • 6.61M • Updated Oct 30, 2025 • 2

michaelbenayoun/qwen3-tiny-4kv-heads-4layers-random

Text Generation • 5.47M • Updated Oct 30, 2025 • 16.3k

liked a model 6 months ago

xai-org/grok-2

Updated Nov 5, 2025 • 17.3k • 1.03k

updated a dataset 7 months ago

huggingface/documentation-images

Viewer • Updated about 17 hours ago • 59 • 1.9M • 109

liked a dataset 7 months ago

huggingface/documentation-images

Viewer • Updated about 17 hours ago • 59 • 1.9M • 109

updated a model 7 months ago

michaelbenayoun/deepseekv3-tiny-4kv-heads-4-layers-random

Text Generation • 5.27M • Updated Jul 24, 2025 • 3

published a model 7 months ago

michaelbenayoun/deepseekv3-tiny-4kv-heads-4-layers-random

Text Generation • 5.27M • Updated Jul 24, 2025 • 3

upvoted an article 7 months ago

Article

Creating custom kernels for the AMD MI300

Jul 9, 2025

•

updated a model 8 months ago

michaelbenayoun/granite-tiny-4kv-heads-4layers-random

Text Generation • 4.2M • Updated Jun 18, 2025 • 507

published 3 models 8 months ago

updated 3 models 9 months ago

michaelbenayoun/lora-qkv-included-llama-2-tiny-4kv-heads-4layers-random

Updated Jun 2, 2025

michaelbenayoun/lora-2-qkv-included-llama-2-tiny-4kv-heads-4layers-random

Updated Jun 2, 2025

michaelbenayoun/llama-2-tiny-4kv-heads-4layers-random

Text Generation • 8.54M • Updated Jun 2, 2025 • 63k

published a model 9 months ago

michaelbenayoun/lora-2-qkv-included-llama-2-tiny-4kv-heads-4layers-random

Updated Jun 2, 2025

updated a model 9 months ago

michaelbenayoun/lora-qkv-included-llama-2-tiny-4kv-heads-4layers-random

Updated Jun 2, 2025

Michael Benayoun

AI & ML interests

Recent Activity

Organizations

michaelbenayoun's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Creating custom kernels for the AMD MI300