Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alphaXiv 's Collections
Agent-R1
Reproducing-TRM

Agent-R1

updated 2 days ago
Upvote
-

  • alphaXiv/Qwen-2.5-1.5b-instruct-ppo

    2B • Updated 2 days ago • 29

  • alphaXiv/Qwen-2.5-1.5b-instruct-grpo

    2B • Updated 2 days ago • 17
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs