-
tartuNLP/Llama-3.1-EstLLM-8B-Instruct-1125
Text Generation • 8B • Updated • 449 • • 6 -
tartuNLP/Llama-3.1-EstLLM-8B-Instruct-0825
Text Generation • 8B • Updated • 7 • 2 -
tartuNLP/Llama-3.1-EstLLM-8B-0525
Text Generation • 8B • Updated • 21 • -
EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training
Paper • 2603.02041 • Published
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
Organization Card
We are the research group of natural language processing at the Institute of Computer Science, University of Tartu. Our areas of focus include machine translation, speech synthesis, NLP for Estonian and others.
-
tartuNLP/Llama-3.1-EstLLM-8B-Instruct-1125
Text Generation • 8B • Updated • 449 • • 6 -
tartuNLP/Llama-3.1-EstLLM-8B-Instruct-0825
Text Generation • 8B • Updated • 7 • 2 -
tartuNLP/Llama-3.1-EstLLM-8B-0525
Text Generation • 8B • Updated • 21 • -
EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training
Paper • 2603.02041 • Published
A collection of resources for evaluation of LLM capabilities in the Estonian language.
models 76
tartuNLP/est-roberta-vm-morph-homonym-tagging
Token Classification • 0.1B • Updated • 17
tartuNLP/Llammas-base-p1-GPT-4o-human-error-mix-paragraph-GEC
Text Generation • 7B • Updated • 32.1k •
tartuNLP/Apertus-EstLLM-8B-Instruct-0326
Text Generation • 8B • Updated • 105
tartuNLP/est-roberta-hist-ner
Token Classification • 0.1B • Updated • 8
tartuNLP/est-roberta-hist-ner-for-tccp
Token Classification • 0.1B • Updated • 8
tartuNLP/est-roberta-vm-morph-tagging
Token Classification • 0.1B • Updated • 8
tartuNLP/Apertus-EstLLM-8B-1125
Text Generation • 8B • Updated • 9
tartuNLP/Llama-3.1-EstLLM-8B-0525
Text Generation • 8B • Updated • 21 •
tartuNLP/Llama-3.1-EstLLM-8B-Instruct-1125
Text Generation • 8B • Updated • 449 • • 6
tartuNLP/Apertus-EstLLM-8B-Instruct-1125
Text Generation • 8B • Updated • 6 • 1
datasets 37
tartuNLP/SynEstParallel
Viewer • Updated • 1.61B • 203
tartuNLP/winogrande_et
Viewer • Updated • 8.36k • 308 • 1
tartuNLP/sib-smugri
Viewer • Updated • 3.1k • 418
tartuNLP/belebele-smugri
Viewer • Updated • 381 • 126
tartuNLP/wikipedia-smugri-20251201
Viewer • Updated • 1.47M • 138
tartuNLP/smugri4-data
Updated • 100 • 1
tartuNLP/EstSpeechMT
Updated • 5
tartuNLP/Estonian_Subjectivity
Viewer • Updated • 1k • 730
tartuNLP/finepdfs-et
Viewer • Updated • 554k • 101
tartuNLP/finetranslations-et
Viewer • Updated • 10M • 610