lang-uk
/

dragoman

@@ -95,7 +95,17 @@ You can download the [Mistral-7B-v0.1 base model in the GGUF format](https://hug
 and use `ggml-adapter-model.bin` from this repository like this:
 ```
-./main -ngl 32 -m mistral-7b-v0.1.Q4_K_M.gguf --color -c 4096 --temp 0 --repeat_penalty 1.1 -n -1 -p "[INST] who holds this neighborhood [/INST]" --lora ./ggml-adapter-model.bin
 ```
 ### Training Dataset and Resources

 and use `ggml-adapter-model.bin` from this repository like this:
 ```
+./main -ngl 32 -m mistral-7b-v0.1.Q4_K_M.gguf --color -c 4096 --temp 0 --repeat_penalty 1.1 -n -1 -p "[INST] who holds this neighborhood? [/INST]" --lora ./ggml-adapter-model.bin
+```
+### Running the model with mlx-lm
+We merged Dragoman PT adapter into the base model and uploaded the quantized version of the model into https://huggingface.co/lang-uk/dragoman-4bit.
+You can run the model using [mlx-lm](https://pypi.org/project/mlx-lm/):
+```
+python -m mlx_lm.generate --model lang-uk/dragoman-4bit --prompt '[INST] who holds this neighborhood? [/INST]' --temp 0 --max-tokens 100
 ```
 ### Training Dataset and Resources