GLM-4.5-Air-exl3 / README.md
turboderp's picture
Update README.md
32cc18d verified
metadata
license: mit
base_model: zai-org/GLM-4.5-Air
base_model_relation: quantized
quantized_by: turboderp
tags:
  - exl3

EXL3 quants of GLM-4.5-Air

2.00 bits per weight
2.25 bits per weight (optimized)
2.50 bits per weight (optimized)
3.00 bits per weight
3.07 bits per weight (optimized)
3.50 bits per weight (optimized)
4.00 bits per weight

2.00 bpw
2.00 bpw
2.25 bpw
2.25 bpw
2.5 bpw
2.50 bpw
3.00 bpw
3.00 bpw
3.07 bpw
3.07 bpw
3.50 bpw
3.50 bpw
4.00 bpw
4.00 bpw
API
API