Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
205527.2
TFLOPS
1212
216
791
Lewis Tunstall
PRO
lewtun
Follow
Shivansh000's profile picture
soates's profile picture
SiafuDev's profile picture
1285 followers
·
127 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
updated
a model
5 minutes ago
hf-imo-colab/Qwen3-4B-Thinking-2507-Proof
updated
a model
40 minutes ago
hf-imo-colab/Qwen3-4B-Thinking-2507-Proof
updated
a model
44 minutes ago
hf-imo-colab/Qwen3-4B-Thinking-2507-Proof
View all activity
Organizations
lewtun
's models
288
Sort: Recently updated
lewtun/Qwen2.5-0.5B-SFT-LoRA
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-lm-head
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-no-packing
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-QLoRA-packing
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing-no-saved-modules
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-LoRA-packing-pad-token-eos
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-QLoRA-packing-pad-token-eos
Updated
Sep 30, 2024
lewtun/Llama-3.1-8B-SFT-full-packing
Text Generation
•
8B
•
Updated
Sep 30, 2024
•
5
lewtun/Llama-3.1-8B-SFT-LoRA
Updated
Sep 27, 2024
lewtun/Qwen2-0.5B-Reward
Text Classification
•
0.5B
•
Updated
Sep 23, 2024
•
12
lewtun/gemma-2-2b-it-gkd-9b
Updated
Sep 14, 2024
lewtun/gemma-2-2b-it-gkd-27b
Updated
Sep 14, 2024
lewtun/gemma-2-2b-it-gkd
Updated
Sep 14, 2024
lewtun/gemma-2-2b-gkd
Updated
Sep 14, 2024
lewtun/tmp-dpo
Text Generation
•
1.03M
•
Updated
Sep 11, 2024
•
3
lewtun/dpo-model
Updated
Sep 9, 2024
lewtun/dpo-model-lora
Updated
Sep 9, 2024
•
2
lewtun/sft_openassistant-guanaco
Updated
Sep 9, 2024
lewtun/reward-model
Text Classification
•
0.5B
•
Updated
Sep 5, 2024
•
10
lewtun/pythia-6.9b-deduped-tldr-online-dpo
7B
•
Updated
Aug 28, 2024
•
5
lewtun/qwen2-1.5B-ultrafeedback-online-dpo
2B
•
Updated
Aug 28, 2024
•
4
lewtun/qwen2-0.5B-ultrafeedback-online-dpo
0.6B
•
Updated
Aug 28, 2024
•
5
lewtun/pythia-2.8b-deduped-tldr-online-dpo
3B
•
Updated
Aug 27, 2024
•
3
lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-1
Updated
Aug 27, 2024
lewtun/qwen2-7B-ultrafeedback-online-dpo-bs-2
Updated
Aug 27, 2024
lewtun/qwen2-7B-ultrafeedback-online-dpo
Updated
Aug 27, 2024
lewtun/pythia-1b-deduped-tldr-online-dpo
1B
•
Updated
Aug 27, 2024
•
5
lewtun/pythia-1b-tldr-online-dpo
Updated
Aug 27, 2024
lewtun/qwen2-0.5B-lr-5e-7
Updated
Aug 27, 2024
Previous
1
2
3
4
...
10
Next