Trilogix1/Hugston_code-rl-Qwen3-4B-Instruct-2507-SFT-30b pipeline_tag: text-generation tags:
Qwen3 Instruct
Coder 4B
Hugston
Original weights at: https://huggingface.co/code-rl/Qwen3-4B-Instruct-2507-SFT-30b
This model is converted and quantized version by Hugston Team created with Quanta (see Github to know more about it). This is a real, proof-of-concept and implementation on how to convert and quantize a .safetensor llm model in GGUF.
Quantization was performed using an automatic and faster method, which leads to less time and faster results.
This model was made possible by: https://Hugston.com
You can use the model with HugstonOne Enterprise Edition
Tested in general and coding tasks. Loaded with 262000 tokens ctx and feed with 150kb code as input, and gave back 230kb code output or ~ 60000 tokens at once. The code had 5 errors and certainly is not a 0-shot in long coding. It is working with 2-3 tries, which makes it very impressive for it´s size and considering being an instruct model.
Watch HugstonOne coding and preview in action:
https://vimeo.com/1121493834?share=copy&fl=sv&fe=ci
-Download App HugstonOne at Hugston.com or at https://github.com/Mainframework
-Download model from https://hugston.com/explore?folder=llm_models or Huggingface
-If you already have the Llm Model downloaded chose it by clicking pick model in HugstonOne -Then click Load model in Cli or Server
-For multimodal use you need a VL/multimodal LLM model with the Mmproj file in the same folder. -Select model and select mmproj.
-Note: if the mmproj is inside the same folder with other models non multimodal, the non model will not load unless the mmproj is moved from folder.
- Downloads last month
- 346
4-bit
5-bit
6-bit
8-bit
16-bit
32-bit

