Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

92

Base only

Active filters: math-reasoning

MMR1/MMR1-7B-SFT

Image-Text-to-Text • 8B • Updated Oct 1, 2025 • 10

mradermacher/MMR1-7B-SFT-GGUF

8B • Updated Oct 1, 2025 • 191

deepgo/Mobile-ReasoningLLM-v0

Text Generation • 2B • Updated Sep 30, 2025

mradermacher/Mobile-ReasoningLLM-v0-GGUF

2B • Updated Sep 30, 2025 • 33

Vantuk/Qwen3-1.7B-Countdown

Text Generation • 2B • Updated Oct 27, 2025 • 6

deepgo/Mobile-ReasoningLLM-v0.1

Text Generation • 2B • Updated Oct 29, 2025 • 1

mradermacher/Mobile-ReasoningLLM-v0.1-GGUF

2B • Updated Oct 30, 2025 • 42

mradermacher/Mobile-ReasoningLLM-v0.1-i1-GGUF

2B • Updated Dec 10, 2025 • 62

F-urkan/rStar2-Agent-14B-clone

Text Generation • 15B • Updated Oct 31, 2025 • 2

Sashank-810/llama3.1-8b-lft-lora

Text Generation • 8B • Updated Nov 25, 2025

AbstractPhil/math_collective_v1

Text Classification • Updated Dec 4, 2025

sairambokka/gemma3-1b-gsm8k-grpo-reasoning

Updated Dec 4, 2025

Harsha901/Qwen3-4B-Inst-Math-Reasoning-SFT

Text Generation • 4B • Updated Dec 16, 2025 • 2 •

Harsha901/Qwen3_4B_GRPO_GGUF

Text Generation • 4B • Updated Dec 23, 2025 • 87

real-jiakai/SmolLM3-3B-MathReason

Text Generation • 3B • Updated Jan 10 • 33

aaron1729/maslow-rl-gsm8k-gated

Reinforcement Learning • Updated Jan 10

deepgo/Mobile-Flash-v1-1.5B

Text Generation • Updated Feb 14

Shinegupta/ShineMath

Text Generation • Updated Feb 24 • 1 • 1

nbso/simple_pilot_project_model

Text Generation • Updated Feb 22

Asystemoffields/Cclilqwen

Text Generation • 0.8B • Updated Mar 11 •

mariklolik228/sus-qwen2.5-1.5b-grpo-lora

Updated Mar 23 • 6

mariklolik228/grpo-baseline-qwen2.5-1.5b-lora

Updated Mar 22 • 7

camilletyriard/gemma2-qlora-sft-grpo

Text Generation • Updated Apr 3

datasysdev/clsd

jaygala24/Qwen2.5-3B-GRPO-math-reasoning

Text Generation • 3B • Updated Apr 20 • 24 •

jaygala24/Qwen2.5-3B-GRPO-KL-math-reasoning

Text Generation • 3B • Updated Apr 20 • 22 •

jaygala24/Qwen3-1.7B-GRPO-math-reasoning

Text Generation • 2B • Updated Apr 20 • 162 •

jaygala24/Qwen3-1.7B-GRPO-KL-math-reasoning

Text Generation • 2B • Updated Apr 20 • 38 •

jaygala24/Qwen3-4B-GRPO-math-reasoning

Text Generation • 4B • Updated Apr 20 • 52 •

jaygala24/Qwen3-4B-GRPO-KL-math-reasoning

Text Generation • 4B • Updated Apr 20 • 145 •