-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 19
Rajat Ghosh PRO
rghosh8
AI & ML interests
None yet
Recent Activity
updated a collection 26 minutes ago
Opencoder-GRPO updated a collection 26 minutes ago
Opencoder-GRPO updated a model 26 minutes ago
rghosh8/deepseek-llm-7b-chat-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params_mergedOrganizations
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 189 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 99 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 16
arc-grpo-baseline
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-4
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 14 -
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-4
Text Generation • Updated • 19
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 189 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 99 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 16
models 117
rghosh8/deepseek-llm-7b-chat-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params_merged
7B • Updated
rghosh8/deepseek-llm-7b-chat-opencoder-educational-instruct-seed-42-G-4-REDUCED-LAYERS-new-params
Text Generation • Updated
rghosh8/arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-3407-G-16
Text Generation • Updated • 14
rghosh8/arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-3407-G-4
Text Generation • Updated • 17
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 16
rghosh8/arc-grpo-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 14
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-baseline
Text Generation • Updated • 14
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-LAYERS-2_merged
4B • Updated • 31
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-LAYERS-2
Text Generation • Updated • 15
rghosh8/gsm8k-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-LAYERS-2_merged
4B • Updated • 13
datasets 5
rghosh8/math-lighteval-processed
Viewer • Updated • 7.5k • 8
rghosh8/Codegen_Code-Search-CDP_Benchmarking
Viewer • Updated • 9 • 16
rghosh8/supportGPT-v8
Viewer • Updated • 7.92k • 13 • 1
rghosh8/supportGPT-v2
Viewer • Updated • 8.17k • 10
rghosh8/supportGPT_data
Viewer • Updated • 149 • 15