Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Mehul Damani
PRO
mehuldamani
Follow
wjurayj's profile picture
John6666's profile picture
Spechawk's profile picture
3 followers
·
0 following
https://damanimehul.github.io
MehulDamani2
damanimehul
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated
a model
1 day ago
mehuldamani/bug_fixing_sft-v1
published
a model
1 day ago
mehuldamani/bug_fixing_sft-v1
updated
a model
3 days ago
mehuldamani/code_gen_arl-ast-addmultiply-7b-v1
View all activity
Organizations
None yet
mehuldamani
's models
256
Sort: Recently updated
mehuldamani/bug_fixing_sft-v1
Text Generation
•
8B
•
Updated
1 day ago
•
239
mehuldamani/code_gen_arl-ast-addmultiply-7b-v1
Text Generation
•
8B
•
Updated
3 days ago
•
323
mehuldamani/code_gen_rlvr-ast-7b-v2
Text Generation
•
8B
•
Updated
3 days ago
•
235
mehuldamani/bug_fixing_arl-7b-addmultiply-v4
Text Generation
•
8B
•
Updated
3 days ago
•
238
mehuldamani/bug_fixing_rlvr-7b-v4
Text Generation
•
8B
•
Updated
3 days ago
•
250
mehuldamani/sft-corrupted-qwen-v3
Text Generation
•
3B
•
Updated
15 days ago
•
1.2k
mehuldamani/sft-corrupted-qwen-v2
Text Generation
•
3B
•
Updated
16 days ago
•
380
mehuldamani/sft-corrupted-qwen-v1
Text Generation
•
3B
•
Updated
16 days ago
•
546
mehuldamani/rlvr-qwen-hmaze-v1
Text Generation
•
3B
•
Updated
17 days ago
•
313
mehuldamani/rlm-qwen-hmaze-v1-high-fifo
Text Generation
•
3B
•
Updated
17 days ago
•
306
mehuldamani/hmaze-oracle-v1-multiply
Text Generation
•
3B
•
Updated
17 days ago
•
290
mehuldamani/hmaze-oracle-v1
Text Generation
•
3B
•
Updated
17 days ago
•
294
mehuldamani/sft-qwen-hmaze-v2
Text Generation
•
3B
•
Updated
18 days ago
•
752
mehuldamani/sft-qwen-hmaze-v1
Text Generation
•
3B
•
Updated
18 days ago
•
416
mehuldamani/sft-qwen-zmaze-v3
Text Generation
•
3B
•
Updated
19 days ago
•
214
mehuldamani/sft-qwen-zmaze-v2
Text Generation
•
3B
•
Updated
20 days ago
•
678
mehuldamani/sft-qwen-zmaze-v1
Text Generation
•
3B
•
Updated
20 days ago
•
446
mehuldamani/sft-qwen-vmaze-v1
Text Generation
•
3B
•
Updated
22 days ago
•
1.07k
mehuldamani/rlvr_multi_k3
Updated
22 days ago
mehuldamani/rlvr_single
Updated
22 days ago
mehuldamani/qwen3_8b_medical_rlcr_multi_k_4
Updated
23 days ago
mehuldamani/qwen3_8b_medical_rlcr_multi_k_2
Updated
23 days ago
mehuldamani/qwen3_8b_medical_rlcr_multi_k_5
Updated
23 days ago
mehuldamani/sft-qwen-maze-v2
Text Generation
•
8B
•
Updated
24 days ago
•
120
mehuldamani/sft-qwen-maze-v1
Text Generation
•
8B
•
Updated
25 days ago
•
746
mehuldamani/sft-maze-v2
Text Generation
•
8B
•
Updated
25 days ago
•
442
mehuldamani/partial-sft-story-v6
Text Generation
•
8B
•
Updated
27 days ago
•
186
mehuldamani/instruct-story-v6
Text Generation
•
8B
•
Updated
27 days ago
•
244
•
1
mehuldamani/sft-new-story-v4
Text Generation
•
8B
•
Updated
29 days ago
•
547
mehuldamani/sft-mini-story
Text Generation
•
8B
•
Updated
Mar 16
•
60
Previous
1
2
3
...
9
Next