Rajat Ghosh PRO
rghosh8
AI & ML interests
None yet
Recent Activity
updated a collection about 2 hours ago
ROBOT-OpenVLA updated a collection about 2 hours ago
ROBOT-OpenVLA updated a model about 2 hours ago
rghosh8/openvla-7b-libero-spatialOrganizations
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 210 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 96 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 3
GSM8k-GRPO
-
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 12 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Text Generation • 7B • Updated • 2.15k -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 6 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged
Text Generation • 7B • Updated • 102
arc-grpo-baseline
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 16 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 2 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 17 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 2
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 102 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 15 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 14 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 86
ROBOT-OpenVLA
arc-grpo-baseline
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 16 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 2 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 17 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 2
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 210 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 96 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 3
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 102 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 15 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 14 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 86
GSM8k-GRPO
-
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 12 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Text Generation • 7B • Updated • 2.15k -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 6 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged
Text Generation • 7B • Updated • 102