Bolian Li's picture

1 1

Bolian Li

lblaoke

·

https://lblaoke.github.io/

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

princeton-nlp/llama3-ultrafeedback-armorm

upvoted a paper 3 months ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

updated a collection 7 months ago

Preference Data

View all activity

Organizations

None yet

lblaoke 's models 44

lblaoke/mistral-v0.1-7b-ppo-self

7B • Updated Feb 4, 2025 • 3

lblaoke/mistral-v0.1-7b-ppo-human

7B • Updated Feb 4, 2025 • 2

lblaoke/llama2-7b-ppo-self-human

7B • Updated Feb 3, 2025 • 5

lblaoke/llama2-7b-ppo-self

7B • Updated Feb 3, 2025 • 5

lblaoke/llama2-7b-ppo-human

7B • Updated Feb 3, 2025 • 3

lblaoke/mistral-v0.3-7b-rm-human

Text Classification • 7B • Updated Jan 14, 2025 • 6

lblaoke/mistral-v0.3-7b-rm-self-human

Text Classification • 7B • Updated Jan 14, 2025 • 7

lblaoke/mistral-v0.3-7b-rm-self

Text Classification • 7B • Updated Jan 14, 2025 • 3

lblaoke/mistral-v0.1-7b-rm-self-human

Text Classification • 7B • Updated Jan 14, 2025 • 7

lblaoke/mistral-v0.1-7b-rm-self

Text Classification • 7B • Updated Jan 14, 2025 • 7

lblaoke/llama2-7b-rm-self

Text Classification • 7B • Updated Jan 14, 2025 • 4

lblaoke/mistral-v0.1-7b-rm-human

Text Classification • 7B • Updated Jan 14, 2025 • 6

lblaoke/llama2-7b-rm-human

Text Classification • 7B • Updated Jan 14, 2025 • 6

lblaoke/llama2-7b-rm-self-human

Text Classification • 7B • Updated Jan 13, 2025 • 8