~1.69M raw Swahili text samples from news, government, education, and legal domains, ideal for LLM pretraining and unsupervised NLP research.
Samwel Ngusa
ngusadeep
·
AI & ML interests
None yet
Recent Activity
updated a Space 5 days ago
lengai-ai/README published a Space 5 days ago
lengai-ai/README updated a dataset 5 days ago
lengai-ai/Swahili-FineTome-Dataset