-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 179 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 91 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 16
Rajat Ghosh PRO
rghosh8
AI & ML interests
None yet
Recent Activity
updated a collection 1 day ago
ARC-GRPO updated a model 1 day ago
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-42-G-4-REDUCED-LAYERS-2_merged updated a collection 1 day ago
ARC-GRPOOrganizations
GSM8k-GRPO
-
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 12 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Text Generation • 7B • Updated • 2.14k -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 6 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged
Text Generation • 7B • Updated • 100
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 32 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 15 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 12 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 65
ARC-GRPO
-
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
2B • Updated • 179 -
rghosh8/arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new
Text Generation • Updated • 13 • 1 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4_merged
4B • Updated • 91 -
rghosh8/arc-grpo-nemotron-mini-4b-instruct-rajat-seed-3407-G-4
Text Generation • Updated • 16
Opencoder-GRPO
-
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4-merged
2B • Updated • 32 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-4
Text Generation • Updated • 15 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8
Text Generation • Updated • 12 -
rghosh8/deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
2B • Updated • 65
GSM8k-GRPO
-
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16
Text Generation • Updated • 12 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-42-G-16_merged
Text Generation • 7B • Updated • 2.14k -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16
Text Generation • Updated • 6 -
rghosh8/gsm8k-deepseek-llm-7b-chat-rajat-seed-3407-G-16_merged
Text Generation • 7B • Updated • 100