Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked a model about 5 hours ago
nvidia/Gemma-4-31B-IT-NVFP4 published a dataset about 5 hours ago
adorkin/olmocr_science_pdfs-software_development updated a dataset about 6 hours ago
adorkin/olmocr_science_pdfs-software_developmentOrganizations
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • Updated • 24.6k • 91 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 41.2k • 158 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.94M • • 670 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • Updated • 1.82M • 259
Multilingual Text Encoders
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • Updated • 24.6k • 91 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 41.2k • 158 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.94M • • 670 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • Updated • 1.82M • 259
spaces 6
Sleeping
Agents
1
NLI Zero Shot Classification
🔍
Zero-shot classification based on natural language inference
Sleeping
Agents
2
GliLem
🤓
Lemmatization disambiguation for Estonian with GliNER
Running
Agents
SigLIP2 + Clothes
🤔
Text-to-image clothing search using SigLIP2
Running
Agents
1
M-CLIP + Clothes
🦀
Text-to-image clothing search using multilingual CLIP
Sleeping
Agents
1
Tweet Emoji Predictor
🧐
Predict an emoji for your tweet (...your X?)
Sleeping
Agents
Sõnajaht Demo
🐠
Keeltevaheline pöördsõnastik
datasets 26
adorkin/olmocr_science_pdfs-software_development
Viewer • Updated • 2.21M
adorkin/olmocr_science_pdfs-games
Viewer • Updated • 157k • 128
adorkin/olmocr_science_pdfs-art_and_design
Viewer • Updated • 873k • 216
adorkin/olmocr_science_pdfs-software
Viewer • Updated • 735k • 287
adorkin/scientific-summaries-pubmed-open-access
Viewer • Updated • 270k • 46
adorkin/nemotron-code-student-teacher-10M
Viewer • Updated • 10M • 44
adorkin/fineweb2-vro
Viewer • Updated • 8.28k • 23
adorkin/finetranslations-vro-converted
Viewer • Updated • 8.2k • 26
adorkin/finetranslations-et-sample
Viewer • Updated • 260k • 31
adorkin/Ling-Coder-DPO-filtered
Viewer • Updated • 93.3k • 8