RL Training Sets - a pittawat Collection

pittawat 's Collections

RL Training Sets

Medical Experiments

RL Training Sets

updated Feb 14

POLARIS-Project/Polaris-Dataset-53K

Viewer • Updated Jun 18, 2025 • 53.3k • 801 • 34

Note 53K Math Dataset - AIME - AMC - Omni-MATH - STILL - AReal-Boba
agentica-org/DeepCoder-Preview-Dataset

Viewer • Updated Apr 9, 2025 • 25k • 2.89k • 99

Note 25K Code Dataset (train splits) - LCBv5 - PrimeIntellect’s SYNTHETIC-1 - TACO Verified
miromind-ai/MiroRL-GenQA

Viewer • Updated Aug 11, 2025 • 13.1k • 91 • 12

Note 12.3K Synthesized QA Dataset (train split) - by GPT-4.1
allenai/Dolci-Think-RL-7B

Viewer • Updated Jan 5 • 102k • 2.04k • 16

Note 102K Curated Dataset - Precise IF - Math - Coding - General Chat
nvidia/Nemotron-3-Nano-RL-Training-Blend

Preview • Updated Dec 15, 2025 • 547 • 21
open-thoughts/OpenThoughts-Agent-v1-RL

Viewer • Updated Jan 27 • 728 • 370 • 12
MMR1/MMR1-RL

Viewer • Updated Oct 1, 2025 • 15k • 168 • 1
zwhe99/DeepMath-103K

Viewer • Updated May 29, 2025 • 103k • 6.34k • 356