M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449
Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Organizations
models 51
JunxiongWang/M1-3B
Text Generation • 3B • Updated • 9 • 2
JunxiongWang/M1-3B-SFT
Text Generation • 3B • Updated • 8 • 1
JunxiongWang/MambaInLlama1B_SFT_MATH
1B • Updated • 2
JunxiongWang/MambaInLlama3B_SFT_MATH
3B • Updated • 2
JunxiongWang/MambaInLlama3B_DPO2
3B • Updated • 5
JunxiongWang/MambaInLlama3B_DPO1
3B • Updated • 2
JunxiongWang/MambaInLlama3B_Distill_MATH
3B • Updated • 3
JunxiongWang/MambaInLlama3B_v3
3B • Updated • 1
JunxiongWang/MambaInLlama1B_Distill_MATH
1B • Updated • 3
JunxiongWang/mamba_0_5_distill
Updated • 2
datasets 20
JunxiongWang/QwenFineMATH
Viewer • Updated • 6.71M • 57
JunxiongWang/R1_GR_SFT
Viewer • Updated • 44k • 15
JunxiongWang/R1_SFT
Updated • 90
JunxiongWang/R1_Sythetic_SFT
Viewer • Updated • 1M • 215
JunxiongWang/MATH_SFT
Viewer • Updated • 19.1M • 93
JunxiongWang/R1_OpenThoughts_SFT
Viewer • Updated • 862k • 95
JunxiongWang/R1_am_SFT
Viewer • Updated • 1.4M • 328
JunxiongWang/qwen1b_it_math
Viewer • Updated • 19.1M • 89
JunxiongWang/test_math
Viewer • Updated • 89.1k • 108
JunxiongWang/FineMathV4
Viewer • Updated • 6.7M • 185