Tokenizers for linguistic families
WikiLangs
community
AI & ML interests
Wikilangs is an open-source initiative to democratize access to natural language processing models for every language represented on Wikipedia - A project by @OmarKamali. Graciously sponsored by Featherless.ai.
Recent Activity
models 327
wikilangs/ceb
Text Generation • Updated
wikilangs/es
Text Generation • Updated
wikilangs/en
Text Generation • Updated
wikilangs/it
Text Generation • Updated
wikilangs/fr
Text Generation • Updated
wikilangs/ary
Text Generation • Updated
wikilangs/shi
Text Generation • Updated
• 1
wikilangs/tokenizers_uralic-finnic
Updated
wikilangs/tokenizers_austronesian-malay
Updated
wikilangs/tokenizers_bantu-all
Updated
datasets 0
None public yet