Inference Providers
Active filters: GPTQ
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 250
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 22
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 17.7k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 457
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 18
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 57
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 3.46k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 1.6k
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 243
• 4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 1.84k
• 1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
• 4B • Updated • 29
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
• 8B • Updated • 85
iqbalamo93/Phi-4-mini-instruct-GPTQ-4bit
Text Generation
• 4B • Updated • 147
iqbalamo93/Phi-4-mini-instruct-GPTQ-8bit
Text Generation
• 4B • Updated • 13
• 2
GusPuffy/Legion-V2.1-LLaMa-70B-GPTQ
Text Generation
• 71B • Updated • 1
QuantTrio/DeepSeek-R1-0528-Qwen3-8B-GPTQ-Int4-Int8Mix
Text Generation
• 11B • Updated • 9
• 4
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
• 676B • Updated • 234
• 13
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Lite
Text Generation
• 721B • Updated • 12
• 2
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact
Text Generation
• 847B • Updated • 16
• 5
AXERA-TECH/Qwen2.5-0.5B-Instruct-CTX-Int8
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Medium
Text Generation
• 912B • Updated • 9
• 1
kxdw2580/DeepSeek-R1-0528-Qwen3-8B-catgirl-v2.5-gptqv2-8bit
Text Generation
• 8B • Updated • 2
kxdw2580/DeepSeek-R1-0528-Qwen3-8B-catgirl-v2.5-gptqv2-4bit
Text Generation
• 8B • Updated • 4
dengcao/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Image-Text-to-Text
• 15B • Updated • 13
• 2
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
• 1T • Updated • 303
• 12
GusPuffy/BlackSheep-24B-GPTQ
Text Generation
• 24B • Updated • 3
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
• 248B • Updated • 117
• 4
QuantTrio/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Text Generation
• 15B • Updated • 6
• 1
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
• 534B • Updated • 90
• 7
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
• 253B • Updated • 25
• 4