Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated a model about 4 hours ago
mehuldamani/story_gen_extracted-story-v2 published a model about 4 hours ago
mehuldamani/story_gen_extracted-story-v2 updated a model about 5 hours ago
mehuldamani/story-llm-as-judge-binarized-v2Organizations
None yet