Bielik-11B-v3.0-Instruct MLX (4-bit)

MLX 4-bit quantized version of speakleash/Bielik-11B-v3.0-Instruct for Apple Silicon.

Model Details

Property	Value
Original Model	speakleash/Bielik-11B-v3.0-Instruct
Format	MLX 4-bit
Size	~5.9 GB
Peak Memory	~6.4 GB
Generation Speed	~28 tok/s (M3 Ultra)

Other Quantizations

Variant	Size	Memory	Link
bf16	22 GB	22.4 GB	LibraxisAI/Bielik-11B-v3.0-mlx-bf16
q8	11 GB	11.9 GB	LibraxisAI/Bielik-11B-v3.0-mlx-q8
q5	7.2 GB	7.8 GB	LibraxisAI/Bielik-11B-v3.0-mlx-q5
mxfp4	5.5 GB	6.0 GB	LibraxisAI/Bielik-11B-v3.0-mlx-mxfp4
q4 (this)	5.9 GB	6.4 GB	-

Usage

pip install mlx-lm

mlx_lm.generate --model LibraxisAI/Bielik-11B-v3.0-mlx-q4 --prompt "Cześć, jak się masz?"
mlx_lm.chat --model LibraxisAI/Bielik-11B-v3.0-mlx-q4

from mlx_lm import load, generate

model, tokenizer = load("LibraxisAI/Bielik-11B-v3.0-mlx-q4")
response = generate(model, tokenizer, prompt="Wyjaśnij czym jest sztuczna inteligencja.", max_tokens=256)
print(response)

About Bielik

Bielik is a Polish language model developed by SpeakLeash. This conversion enables native execution on Apple Silicon Macs using the MLX framework.

License

Apache 2.0 - see original model for terms.

Converted by LibraxisAI using mlx-lm

Downloads last month: 36

Safetensors

Model size

11B params

Tensor type

BF16

U32

MLX

Hardware compatibility

4-bit

Model tree for LibraxisAI/Bielik-11B-v3.0-mlx-q4

Base model

speakleash/Bielik-11B-v3-Base-20250730

Finetuned

speakleash/Bielik-11B-v3.0-Instruct

Quantized

(15)

this model