Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
pittawat 's Collections
RL Training Sets
Medical Experiments
Med Models

RL Training Sets

updated Feb 14
Upvote
-

  • POLARIS-Project/Polaris-Dataset-53K

    Viewer • Updated Jun 18, 2025 • 53.3k • 801 • 34

    Note 53K Math Dataset - AIME - AMC - Omni-MATH - STILL - AReal-Boba


  • agentica-org/DeepCoder-Preview-Dataset

    Viewer • Updated Apr 9, 2025 • 25k • 2.89k • 99

    Note 25K Code Dataset (train splits) - LCBv5 - PrimeIntellect’s SYNTHETIC-1 - TACO Verified


  • miromind-ai/MiroRL-GenQA

    Viewer • Updated Aug 11, 2025 • 13.1k • 91 • 12

    Note 12.3K Synthesized QA Dataset (train split) - by GPT-4.1


  • allenai/Dolci-Think-RL-7B

    Viewer • Updated Jan 5 • 102k • 2.04k • 16

    Note 102K Curated Dataset - Precise IF - Math - Coding - General Chat


  • nvidia/Nemotron-3-Nano-RL-Training-Blend

    Preview • Updated Dec 15, 2025 • 547 • 21

  • open-thoughts/OpenThoughts-Agent-v1-RL

    Viewer • Updated Jan 27 • 728 • 370 • 12

  • MMR1/MMR1-RL

    Viewer • Updated Oct 1, 2025 • 15k • 168 • 1

  • zwhe99/DeepMath-103K

    Viewer • Updated May 29, 2025 • 103k • 6.34k • 356
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs