This collection hosts the models and datasets released as part of Pula, the first suite of LLMs for Setswana. Previously BOTS-LM.
Nathan Brown
OxxoCodes
AI & ML interests
Model compression & LLM development
Recent Activity
liked a model 2 days ago
nvidia/Gemma-4-31B-IT-NVFP4 liked a model 2 days ago
0xSero/gemma-4-21b-a4b-it-REAP liked a model 4 months ago
jhu-clsp/mmBERT-smallOrganizations
Pula
This collection hosts the models and datasets released as part of Pula, the first suite of LLMs for Setswana. Previously BOTS-LM.
Distilled Long-Context Encoders
Various efficient attention encoder-style architectures distilled into student models with half the hidden layers, plus a long-context NER dataset
models 17
OxxoCodes/Pula-14B
Text Generation • 15B • Updated • 5 • 1
OxxoCodes/Pula-8B
Text Generation • 8B • Updated • 7 • 2
OxxoCodes/Pula-1B
Text Generation • 1B • Updated • 6 • 1
OxxoCodes/Pula-3B
Text Generation • 3B • Updated • 6 • 1
OxxoCodes/distil-SmolLM2-135M-Instruct
Text Generation • 0.1B • Updated • 4
OxxoCodes/InkubaLM-Instruct-test
Updated • 1
OxxoCodes/Pula-XLMR-large-v0.1
Fill-Mask • 0.6B • Updated • 1 • 1
OxxoCodes/Pula-8B-v0.1
Text Generation • 8B • Updated • 19 • 4
OxxoCodes/Meta-Llama-3-70B-Instruct-GPTQ
Text Generation • Updated • 5 • 2
OxxoCodes/Meta-Llama-3-8B-Instruct-GPTQ
Text Generation • Updated • 1
datasets 11
OxxoCodes/maps
Viewer • Updated • 250 • 15
OxxoCodes/gsm8k-tsn
Viewer • Updated • 1.32k • 10
OxxoCodes/fineweb-10MT
Viewer • Updated • 14.9k • 4
OxxoCodes/Marothodi
Viewer • Updated • 152k • 11 • 1
OxxoCodes/Medupi
Viewer • Updated • 976k • 9
OxxoCodes/Stawberry
Viewer • Updated • 387k • 15 • 1
OxxoCodes/pulabert-dataset
Viewer • Updated • 2.06M • 24
OxxoCodes/mmlu-tsn
Viewer • Updated • 14k • 8
OxxoCodes/gpt4o-setswana-instruct
Viewer • Updated • 1.58k • 22
OxxoCodes/gpt4o-setswana
Viewer • Updated • 1.58k • 6