lang-uk
/

dragoman

@@ -87,6 +87,17 @@ outputs = model.generate(**input_ids)
 print(tokenizer.decode(outputs[0]))
 ```
 ### Training Dataset and Resources
 Training code: [lang-uk/dragoman](https://github.com/lang-uk/dragoman)

 print(tokenizer.decode(outputs[0]))
 ```
+### Running the model with llama.cpp
+We converted Dragoman PT adapter into the [GGUF format](https://huggingface.co/lang-uk/dragoman/blob/main/ggml-adapter-model.bin).
+You can download the [Mistral-7B-v0.1 base model in the GGUF format](https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF) (e.g. mistral-7b-v0.1.Q4_K_M.gguf)
+and use `ggml-adapter-model.bin` from this repository like this:
+```
+./main -ngl 32 -m mistral-7b-v0.1.Q4_K_M.gguf --color -c 4096 --temp 0 --repeat_penalty 1.1 -n -1 -p "[INST] who holds this neighborhood [/INST]" --lora ./ggml-adapter-model.bin
+```
 ### Training Dataset and Resources
 Training code: [lang-uk/dragoman](https://github.com/lang-uk/dragoman)