Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ilgee
/
Binary-Think-RM-8B
like
0
Safetensors
English
llama
reward-model
RLHF
reasoning
preference-learning
arxiv:
2505.16265
License:
llama3.1
Model card
Files
Files and versions
xet
Community
main
Binary-Think-RM-8B
Commit History
Upload README.md with huggingface_hub
50a5793
verified
ilgee
commited on
Nov 2, 2025
Upload README.md with huggingface_hub
87aa611
verified
ilgee
commited on
Oct 23, 2025
Update model card
a9c8eae
verified
ilgee
commited on
Oct 12, 2025
Update model card
7d75a16
verified
ilgee
commited on
Oct 12, 2025
Upload README.md with huggingface_hub
3c64627
verified
ilgee
commited on
Oct 12, 2025
Upload model with updated chat template
e3b2ad3
verified
ilgee
commited on
May 8, 2025
initial commit
22cc553
verified
ilgee
commited on
May 8, 2025