Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
Bolian Li
lblaoke
Follow
AmberYifan's profile picture
1 follower
·
1 following
https://lblaoke.github.io/
lblaoke
lblaoke
bolian-li-554001297
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 2 months ago
princeton-nlp/llama3-ultrafeedback-armorm
upvoted
a
paper
3 months ago
Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
updated
a collection
7 months ago
Preference Data
View all activity
Organizations
None yet
lblaoke
's models
44
Sort: Recently updated
lblaoke/mistral-v0.1-7b-ppo-self
7B
•
Updated
Feb 4, 2025
•
3
lblaoke/mistral-v0.1-7b-ppo-human
7B
•
Updated
Feb 4, 2025
•
2
lblaoke/llama2-7b-ppo-self-human
7B
•
Updated
Feb 3, 2025
•
5
lblaoke/llama2-7b-ppo-self
7B
•
Updated
Feb 3, 2025
•
5
lblaoke/llama2-7b-ppo-human
7B
•
Updated
Feb 3, 2025
•
3
lblaoke/mistral-v0.3-7b-rm-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
6
lblaoke/mistral-v0.3-7b-rm-self-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
7
lblaoke/mistral-v0.3-7b-rm-self
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
3
lblaoke/mistral-v0.1-7b-rm-self-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
7
lblaoke/mistral-v0.1-7b-rm-self
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
7
lblaoke/llama2-7b-rm-self
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
4
lblaoke/mistral-v0.1-7b-rm-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
6
lblaoke/llama2-7b-rm-human
Text Classification
•
7B
•
Updated
Jan 14, 2025
•
6
lblaoke/llama2-7b-rm-self-human
Text Classification
•
7B
•
Updated
Jan 13, 2025
•
8
Previous
1
2
Next