Marco Cimolai

marco

7 7 37

AI & ML interests

None yet

Recent Activity

liked a model 24 days ago

LiquidAI/LFM2.5-VL-450M-Extract

liked a dataset 29 days ago

facebook/omnilingual-asr-corpus

liked a dataset about 1 month ago

infly/Infinity-Doc2-5M

View all activity

Organizations

liked a model 24 days ago

LiquidAI/LFM2.5-VL-450M-Extract

Image-Text-to-Text • 0.4B • Updated 24 days ago • 4.57k • 49

liked a dataset 29 days ago

facebook/omnilingual-asr-corpus

Viewer • Updated Nov 14, 2025 • 548k • 3.83k • 206

liked a dataset about 1 month ago

infly/Infinity-Doc2-5M

Viewer • Updated 19 days ago • 130 • 1.64k • 16

liked a model about 1 month ago

NemoStation/Marlin-2B

Video-Text-to-Text • 2B • Updated about 1 month ago • 14.1k • 549

liked 2 models about 2 months ago

nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Any-to-Any • 33B • Updated May 8 • 828k • 365

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated May 19 • 374k • 1.41k

liked a dataset about 2 months ago

NJU-LINK/OmniVideoBench

Viewer • Updated Apr 8 • 1k • 1.91k • 5

liked a model about 2 months ago

openbmb/MiniCPM-V-4.6

Image-Text-to-Text • 1B • Updated 25 days ago • 841k • 1.13k

upvoted an article 2 months ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 73

liked a dataset 4 months ago

bofenghuang/stt-pseudo-labeled-whisper-large-v3-multilingual

Updated Mar 19, 2025 • 22.5k • 4

upvoted an article 5 months ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

lightonai

•

Jan 19

• 96

liked a dataset 7 months ago

unstructuredio/SCORE-Bench

Viewer • Updated Dec 9, 2025 • 15.3k • 918 • 7

liked a model 7 months ago

tencent/HunyuanOCR

Image-Text-to-Text • 1.0B • Updated Jan 13 • 247k • 760

liked a model 8 months ago

yonigozlan/EdgeTAM-hf

Mask Generation • 13.9M • Updated Nov 6, 2025 • 6.53k • 72

liked a model 9 months ago

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 2 days ago • 7.42k • 1.63k

liked a model 11 months ago

nvidia/parakeet-tdt-0.6b-v3

Automatic Speech Recognition • 0.6B • Updated about 3 hours ago • 142k • • 954

upvoted an article about 1 year ago

Article

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

manu

•

Jun 2, 2025

• 28

liked a model about 1 year ago

Qwen/Qwen3-0.6B-FP8

Text Generation • 0.8B • Updated Jul 26, 2025 • 2M • 62

upvoted 2 collections over 1 year ago

Ovis2

Collection

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25, 2025 • 67

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 566

Marco Cimolai

AI & ML interests

Recent Activity

Organizations

marco's activity

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

*Context Is Gold to Find the Gold Passage*: Evaluating and Training Contextual Document Embeddings

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings