👋 Open to Work

Aritra Dutta

dutta18

22 17 18

https://vpnleaderboard.com/

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

IDEA-Research/grounding-dino-base

upvoted an article about 1 month ago

Continuous batching for GRPO, now in TRL

upvoted an article about 1 month ago

Beyond LoRA: Can you beat the most popular fine-tuning technique?

View all activity

Organizations

liked a model about 1 month ago

IDEA-Research/grounding-dino-base

Zero-Shot Object Detection • 0.2B • Updated May 12, 2024 • 1.24M • 199

upvoted 2 articles about 1 month ago

Article

Continuous batching for GRPO, now in TRL

sergiopaniego

•

Jun 19

• 8

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

BenjaminB, sayakpaul, hubnemo, kashif

•

Jun 18

• 81

upvoted a collection 3 months ago

LLaVa-NeXT

Collection

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 35

updated a dataset 3 months ago

dutta18/esnlive

Viewer • Updated Apr 15 • 129k • 214

published a dataset 3 months ago

dutta18/esnlive

Viewer • Updated Apr 15 • 129k • 214

upvoted an article 3 months ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 67

New activity in lmms-lab/DocVQA 4 months ago

DataFilesNotFoundError: No (supported) data files found in lmms-lab/DocVQA

#5 opened 4 months ago by

dutta18

liked a model 4 months ago

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 7.09k • 1.59k

updated a dataset 4 months ago

dutta18/A-OKVQA-17K

Viewer • Updated Apr 2 • 18.2k • 121

published a dataset 4 months ago

dutta18/A-OKVQA-17K

Viewer • Updated Apr 2 • 18.2k • 121

updated a dataset 4 months ago

dutta18/Physical-Reasoning-VQA-45K

Viewer • Updated Apr 2 • 64.9k • 122

published a dataset 4 months ago

dutta18/Physical-Reasoning-VQA-45K

Viewer • Updated Apr 2 • 64.9k • 122

updated a dataset 4 months ago

dutta18/Quantity-Reasoning-VQA-23K

Viewer • Updated Apr 2 • 23.7k • 73

published a dataset 4 months ago

dutta18/Quantity-Reasoning-VQA-23K

Viewer • Updated Apr 2 • 23.7k • 73

upvoted a collection 4 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 566

New activity in google/gemma-3-4b-it 5 months ago

Finetuning Code Link In Native PyTorch

#87 opened 5 months ago by

dutta18

liked a model 5 months ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 112k • 1.63k

New activity in mistralai/Ministral-3-3B-Instruct-2512 5 months ago

How to use local image in the chat template?

#15 opened 5 months ago by

dutta18

updated a dataset 6 months ago

dutta18/multidomain-VQA-with-cot-trace-9K

Viewer • Updated Feb 6 • 10.8k • 51

Aritra Dutta

AI & ML interests

Recent Activity

Organizations

dutta18's activity

Continuous batching for GRPO, now in TRL

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Multimodal Embedding & Reranker Models with Sentence Transformers

DataFilesNotFoundError: No (supported) data files found in lmms-lab/DocVQA

Finetuning Code Link In Native PyTorch

How to use local image in the chat template?