Collection of all datasets for the Pnar language personally curated by organisation members.
AI & ML interests
Natural Language Processing for low-resource indigenous languages of Meghalaya
Organization Card
Tynrai
Tynrai (Tynrai-AI) is an initiative dedicated to the preservation of language through technology. We focus on digitizing, documenting, and revitalizing the indigenous languages of Meghalaya, India.
We build, curate, and release datasets and models including conversational agents that prioritize real-world impact for the Khasic and Garo languages.
Mission
- Preserve and digitize indigenous languages
- Research in low-resource NLP
- Build high-quality datasets and reproducible models
Areas of Focus
- Neural Machine Translation (NMT): Specializing in Khasi, Garo, and Pnar
- Automatic Speech Recognition (ASR): Speech-to-text for indigenous languages
- Text-to-Speech (TTS): Natural speech generation for local languages
- Conversational AI: Chat Bots and dialogue systems
- Language Preservation: Documentation & corpus creation
What You’ll Find Here
- Chat Bots: Interactive conversational agents for learning and assistance
- Datasets: Parallel corpora, annotated text, speech resources, and QA Datasets
- Models: Fine-tuned and experimental NLP models
- Spaces: Demos and interactive experiments
Contact
For collaboration or questions: - Hugging Face Discussions
Low-resource does not mean low-impact.
spaces 5
Running
Ri-Gemma-Instruct-GGUF
☁
Chat with Ri: The Khasi language AI assistant by Tynrai AI.
Paused
Agents
English To Pnar Translator
📖
English-Pnar Translation demo using a Fine-tuned NLLB model
Sleeping
Agents
Khasi Spell Checker
⚡
A simple statistical spell-checker for the Khasi language
Sleeping
Bapyn NLLB En Kha
👁
This is a Bi-Directional English-Khasi translation model.
models 0
None public yet
datasets 0
None public yet