Trilogix1/Hugston_code-rl-Qwen3-4B-Instruct-2507-SFT-30b pipeline_tag: text-generation tags:

Qwen3 Instruct

Coder 4B

Hugston


Original weights at: https://huggingface.co/code-rl/Qwen3-4B-Instruct-2507-SFT-30b

This model is converted and quantized version by Hugston Team created with Quanta (see Github to know more about it). This is a real, proof-of-concept and implementation on how to convert and quantize a .safetensor llm model in GGUF.

Screenshot 2025-11-21 114116

Quantization was performed using an automatic and faster method, which leads to less time and faster results.

This model was made possible by: https://Hugston.com

You can use the model with HugstonOne Enterprise Edition

Tested in general and coding tasks. Loaded with 262000 tokens ctx and feed with 150kb code as input, and gave back 230kb code output or ~ 60000 tokens at once. The code had 5 errors and certainly is not a 0-shot in long coding. It is working with 2-3 tries, which makes it very impressive for it´s size and considering being an instruct model.

Screenshot 2025-11-26 145749


Watch HugstonOne coding and preview in action:

https://vimeo.com/1121493834?share=copy&fl=sv&fe=ci

-Download App HugstonOne at Hugston.com or at https://github.com/Mainframework

-Download model from https://hugston.com/explore?folder=llm_models or Huggingface

-If you already have the Llm Model downloaded chose it by clicking pick model in HugstonOne -Then click Load model in Cli or Server


-For multimodal use you need a VL/multimodal LLM model with the Mmproj file in the same folder. -Select model and select mmproj.


-Note: if the mmproj is inside the same folder with other models non multimodal, the non model will not load unless the mmproj is moved from folder.

Downloads last month
346
GGUF
Model size
4B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support