Inference Providers
Active filters: vLLM
mistralai/Mistral-Medium-3.5-128B
128B • Updated • 444k
• 366
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 73.5k
• 396
mistralai/Mistral-Small-4-119B-2603-NVFP4
Updated • 1.41k
• 100
Image-Text-to-Text
• 10B • Updated • 559k
• 22
mistralai/Mistral-Medium-3.5-128B-EAGLE
Updated • 279
• 48
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 8.98k
• 73
QuantTrio/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
• 235B • Updated • 13.5k
• 11
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated • 15.5k
• 28
mistralai/Mistral-Small-4-119B-2603-eagle
Updated • 296
• 52
bartowski/mistralai_Mistral-Small-4-119B-2603-GGUF
Image-Text-to-Text
• 119B • Updated • 2.49k
• 12
mradermacher/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 265
• 1
mradermacher/Mistral-Small-4-119B-2603-i1-GGUF
119B • Updated • 4.46k
• 3
QuantTrio/Qwen3.6-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 524k
• 15
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 64
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 5
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 100
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 89
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 7
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 96
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 261
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 230
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 8
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 193
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 10
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 1.83k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 153
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 20
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 43
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 37.6k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 2.54k