Trilogix1/Hugston_code-rl-Qwen3-4B-Instruct-2507-SFT-30b pipeline_tag: text-generation tags:

Qwen3 Instruct

Coder 4B

Hugston

Original weights at: https://huggingface.co/code-rl/Qwen3-4B-Instruct-2507-SFT-30b

This model is converted and quantized version by Hugston Team created with Quanta (see Github to know more about it). This is a real, proof-of-concept and implementation on how to convert and quantize a .safetensor llm model in GGUF.

Quantization was performed using an automatic and faster method, which leads to less time and faster results.

This model was made possible by: https://Hugston.com

You can use the model with HugstonOne Enterprise Edition

Tested in general and coding tasks. Loaded with 262000 tokens ctx and feed with 150kb code as input, and gave back 230kb code output or ~ 60000 tokens at once. The code had 5 errors and certainly is not a 0-shot in long coding. It is working with 2-3 tries, which makes it very impressive for it´s size and considering being an instruct model.

Watch HugstonOne coding and preview in action:

https://vimeo.com/1121493834?share=copy&fl=sv&fe=ci

-Download App HugstonOne at Hugston.com or at https://github.com/Mainframework

-Download model from https://hugston.com/explore?folder=llm_models or Huggingface

-If you already have the Llm Model downloaded chose it by clicking pick model in HugstonOne -Then click Load model in Cli or Server

-For multimodal use you need a VL/multimodal LLM model with the Mmproj file in the same folder. -Select model and select mmproj.

-Note: if the mmproj is inside the same folder with other models non multimodal, the non model will not load unless the mmproj is moved from folder.

Downloads last month: 346

GGUF

Model size

4B params

Architecture

qwen3

Hardware compatibility

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit