-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
mlx-community/Qwen2.5-14B-Instruct-1M-8bit
Text Generation
•
4B
•
Updated
•
95
•
9
MaziyarPanahi/Mistral-Small-24B-Instruct-2501-GGUF
Text Generation
•
24B
•
Updated
•
152k
•
8
MaziyarPanahi/Captain-Eris_Violet_Toxic-Magnum-12B-GGUF
Text Generation
•
12B
•
Updated
•
120
•
4
driaforall/Tiny-Agent-a-3B-Q8-mlx
0.9B
•
Updated
•
15
•
4
driaforall/Tiny-Agent-a-1.5B-Q8-mlx
0.4B
•
Updated
•
8
•
3
driaforall/Tiny-Agent-a-0.5B-Q8-mlx
0.1B
•
Updated
•
4
•
3
Text Generation
•
397B
•
Updated
•
19.2k
•
271
Text Generation
•
0.5B
•
Updated
•
111
•
10
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
20.2k
•
20
Text Generation
•
0.9B
•
Updated
•
61
•
13
tiiuae/Falcon-E-3B-Instruct
Text Generation
•
0.9B
•
Updated
•
443
•
37
MaziyarPanahi/Qwen3-30B-A3B-GGUF
Text Generation
•
31B
•
Updated
•
232k
•
4
Qwen/Qwen3-1.7B-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
1.39k
•
7
Qwen/Qwen3-0.6B-GPTQ-Int8
Text Generation
•
0.6B
•
Updated
•
3.66k
•
8
Text Generation
•
0.2B
•
Updated
•
929
•
4
Text Generation
•
0.5B
•
Updated
•
837
•
3
Text Generation
•
2B
•
Updated
•
11.5k
•
8
Text Generation
•
4B
•
Updated
•
907
•
4
Text Generation
•
1B
•
Updated
•
873
•
3
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
•
397B
•
Updated
•
15.2k
•
41
Text Generation
•
9B
•
Updated
•
1k
•
11
Qwen/Qwen3-30B-A3B-MLX-8bit
Text Generation
•
8B
•
Updated
•
163
•
9
Qwen/Qwen3-235B-A22B-MLX-8bit
Text Generation
•
62B
•
Updated
•
186
•
9
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
5.03k
•
14
mlx-community/LFM2-350M-8bit
Text Generation
•
99.7M
•
Updated
•
252
•
4
huizimao/gpt-oss-120b-uncensored-mxfp4
117B
•
Updated
•
372
•
6
driaforall/mem-agent-mlx-8bit
Text Generation
•
1B
•
Updated
•
11
•
2
shanjiaz/gpt-oss-120b-nvfp4-modelopt
59B
•
Updated
•
9.08k
•
2
EpistemeAI/Episteme-gptoss-20b-RL
Text Generation
•
22B
•
Updated
•
2
•
2
FabioSarracino/VibeVoice-Large-Q8
Text-to-Audio
•
9B
•
Updated
•
2.14k
•
83