👋 Open to Work

RDTvlokip PRO

RDTvlokip

94 3

https://rdtvlokip.fr

AI & ML interests

None yet

Recent Activity

posted an update 1 day ago

I finally changed the architecture of my 15M French LLM. It worked. Then I almost fooled myself about how much and catching that was the real win. After proving last time that architecture is a threshold, not a lever, I got stubborn: could I change how the model learns? Four honest attempts, Lion, a sharper AdamW β2, multi-token prediction, LayerScale. Four failures. The bottleneck wasn't the learning rule either. So I changed the shape of the computation instead: loop the same transformer blocks 4×, deeper reasoning, zero added parameters. It beat the baseline on perplexity, the first thing in the whole project to move that number. Then I added my own twist: let each token decide how deep to think, halting on its own entropy. My first evaluation was spectacular. Coherence up 65%. Hallucinated names down 62%. It was noise. Eight prompts, one seed. I re-ran on 50 prompts × 200 tokens and watched the gains shrink to "modest" and on out-of-domain prompts, recurrence actually made things worse. No universal winner. And none of it is new: it's Adaptive Computation Time (2016), the Universal Transformer (2018), and LoopViT (2026), recombined and measured honestly. The real lesson: A number from 8 prompts is a rumor. The eval harness that kills your own best result is worth more than the result it kills. Cite your lineage. Stay preliminary until multiple seeds say otherwise. The three models are live. The write-up is honest about every caveat 👇 🔗 https://huggingface.co/blog/RDTvlokip/teaching-a-15m-french-llm-to-think-deeper

upvoted an article 2 days ago

🔁 Apprendre à un LLM français de 15M à penser plus profond — et à savoir quand s'arrêter 🇫🇷

published an article 2 days ago

🔁 Apprendre à un LLM français de 15M à penser plus profond — et à savoir quand s'arrêter 🇫🇷

View all activity

Organizations

published an article 2 days ago

Article

🔁 Apprendre à un LLM français de 15M à penser plus profond — et à savoir quand s'arrêter 🇫🇷

RDTvlokip

•

2 days ago

• 1

published an article 2 days ago

Article

🔁 Teaching a 15M French LLM to think deeper — and to know when to stop 🇫🇷

RDTvlokip

•

2 days ago

• 1

published an article 5 days ago

Article

🔧 L'architecture est un seuil, pas un levier — ce que j'ai appris en optimisant un LLM français de 15M de paramètres 🇫🇷

RDTvlokip

•

5 days ago

• 1

published an article 5 days ago

Article

🔧 Architecture is a threshold, not a lever — what I learned optimizing a 15M French LLM 🇫🇷

RDTvlokip

•

5 days ago

• 1

published an article 2 months ago

Article

🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷

RDTvlokip

•

May 5

• 6

published an article 2 months ago

Article

🧠 J'ai entraîné mon propre LLM français from scratch — seul, avec une 1080 Ti, et le courant a coupé ⚡🇫🇷

RDTvlokip

•

May 5

• 2

published an article 4 months ago

Article

🧲 Embeddings — When AI turns words into GPS coordinates! 📍🧠

RDTvlokip

•

Mar 9

• 1

published an article 4 months ago

Article

🧲 Embeddings — Quand l'IA transforme les mots en coordonnées GPS ! 📍🧠

RDTvlokip

•

Mar 9

• 1

published an article 5 months ago

Article

🎯 PCA (Principal Component Analysis) — Compresser les dimensions comme un boss ! 📊🔥

RDTvlokip

•

Feb 17

• 1

published an article 5 months ago

Article

🎯 PCA (Principal Component Analysis) — Compressing dimensions like a boss! 📊🔥

RDTvlokip

•

Feb 17

• 1

published an article 5 months ago

Article

🎯 K-Means — Quand l'IA organise le chaos en boîtes bien rangées ! 📦✨

RDTvlokip

•

Jan 29

• 1

published an article 5 months ago

Article

🎯 K-Means — When AI organizes chaos into neat boxes! 📦✨

RDTvlokip

•

Jan 29

• 1

published an article 6 months ago

Article

🎯 F1-Score — Quand l'Accuracy te ment en pleine face ! 📊💥

RDTvlokip

•

Jan 16

• 1

published an article 6 months ago

Article

🎯 F1-Score — When Accuracy lies to your face! 📊💥

RDTvlokip

•

Jan 16

• 1

published an article 6 months ago

Article

🎯 Precision & Recall — Les métriques jumelles qui ne sont jamais d'accord ! ⚖️🔍

RDTvlokip

•

Jan 8

• 1

published an article 6 months ago

Article

🎯 Precision & Recall — The twin metrics that never agree! ⚖️🔍

RDTvlokip

•

Jan 8

• 1

published an article 6 months ago

Article

🎆 AI 2026 — The 9 trends that will EXPLODE this year! 🚀💥

RDTvlokip

•

Jan 1

• 2

published an article 6 months ago

Article

🎆 IA 2026 — Les 9 tendances qui vont exploser cette année ! 🚀💥

RDTvlokip

•

Jan 1

• 2

published an article 6 months ago

Article

📊 Cross-Entropy — The loss function that KNOWS how to punish! 🎯🔥

RDTvlokip

•

Dec 29, 2025

• 1

published an article 6 months ago

Article

📊 Cross-Entropy — La fonction de perte qui SAIT punir ! 🎯🔥

RDTvlokip

•

Dec 29, 2025

• 2

RDTvlokip PRO

AI & ML interests

Recent Activity

Organizations

RDTvlokip's activity

🔁 Apprendre à un LLM français de 15M à penser plus profond — et à savoir quand s'arrêter 🇫🇷

🔁 Teaching a 15M French LLM to think deeper — and to know when to stop 🇫🇷

🔧 L'architecture est un seuil, pas un levier — ce que j'ai appris en optimisant un LLM français de 15M de paramètres 🇫🇷

🔧 Architecture is a threshold, not a lever — what I learned optimizing a 15M French LLM 🇫🇷

🧠 I trained my own French LLM from scratch — alone, with a 1080 Ti, and the power went out ⚡🇫🇷

🧠 J'ai entraîné mon propre LLM français from scratch — seul, avec une 1080 Ti, et le courant a coupé ⚡🇫🇷

🧲 Embeddings — When AI turns words into GPS coordinates! 📍🧠

🧲 Embeddings — Quand l'IA transforme les mots en coordonnées GPS ! 📍🧠

🎯 PCA (Principal Component Analysis) — Compresser les dimensions comme un boss ! 📊🔥

🎯 PCA (Principal Component Analysis) — Compressing dimensions like a boss! 📊🔥

🎯 K-Means — Quand l'IA organise le chaos en boîtes bien rangées ! 📦✨

🎯 K-Means — When AI organizes chaos into neat boxes! 📦✨

🎯 F1-Score — Quand l'Accuracy te ment en pleine face ! 📊💥

🎯 F1-Score — When Accuracy lies to your face! 📊💥

🎯 Precision & Recall — Les métriques jumelles qui ne sont jamais d'accord ! ⚖️🔍

🎯 Precision & Recall — The twin metrics that never agree! ⚖️🔍

🎆 AI 2026 — The 9 trends that will EXPLODE this year! 🚀💥

🎆 IA 2026 — Les 9 tendances qui vont exploser cette année ! 🚀💥

📊 Cross-Entropy — The loss function that KNOWS how to punish! 🎯🔥

📊 Cross-Entropy — La fonction de perte qui SAIT punir ! 🎯🔥