Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a model 5 minutes ago

hf-imo-colab/Qwen3-4B-Thinking-2507-Proof

updated a model 40 minutes ago

hf-imo-colab/Qwen3-4B-Thinking-2507-Proof

updated a model 44 minutes ago

hf-imo-colab/Qwen3-4B-Thinking-2507-Proof

View all activity

Organizations

lewtun 's models 288

lewtun/Qwen2.5-0.5B-SFT-LoRA

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-lm-head

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-no-packing

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-QLoRA-packing

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-saved-modules

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-packing

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-LoRA-packing-pad-token-eos

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-QLoRA-packing-pad-token-eos

Updated Sep 30, 2024

lewtun/Llama-3.1-8B-SFT-full-packing

Text Generation • 8B • Updated Sep 30, 2024 • 5

lewtun/Llama-3.1-8B-SFT-LoRA

Updated Sep 27, 2024

lewtun/Qwen2-0.5B-Reward

Text Classification • 0.5B • Updated Sep 23, 2024 • 12

lewtun/gemma-2-2b-it-gkd-9b

Updated Sep 14, 2024

lewtun/gemma-2-2b-it-gkd-27b

Updated Sep 14, 2024

lewtun/gemma-2-2b-it-gkd

Updated Sep 14, 2024

lewtun/gemma-2-2b-gkd

Updated Sep 14, 2024

lewtun/tmp-dpo

Text Generation • 1.03M • Updated Sep 11, 2024 • 3

lewtun/dpo-model

Updated Sep 9, 2024

lewtun/dpo-model-lora

Updated Sep 9, 2024 • 2

lewtun/sft_openassistant-guanaco

Updated Sep 9, 2024

lewtun/reward-model

Text Classification • 0.5B • Updated Sep 5, 2024 • 10

lewtun/pythia-6.9b-deduped-tldr-online-dpo

7B • Updated Aug 28, 2024 • 5

lewtun/qwen2-1.5B-ultrafeedback-online-dpo

2B • Updated Aug 28, 2024 • 4

lewtun/qwen2-0.5B-ultrafeedback-online-dpo

0.6B • Updated Aug 28, 2024 • 5

lewtun/pythia-2.8b-deduped-tldr-online-dpo

3B • Updated Aug 27, 2024 • 3

lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-1

Updated Aug 27, 2024

lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-2

Updated Aug 27, 2024

lewtun/qwen2-7B-ultrafeedback-online-dpo

Updated Aug 27, 2024

lewtun/pythia-1b-deduped-tldr-online-dpo

1B • Updated Aug 27, 2024 • 5

lewtun/pythia-1b-tldr-online-dpo

Updated Aug 27, 2024

lewtun/qwen2-0.5B-lr-5e-7

Updated Aug 27, 2024