CapRL-Qwen3VL-4B-GGUF

CapRL-Qwen3VL-4B from internlm is a 4B-parameter vision-language model from the CapRL 2.0 series, fine-tuned from Qwen3-VL-4B using an upgraded Reinforcement Learning with Verifiable Rewards (RLVR) two-stage pipelineโ€”LVLMs generate rich captions followed by vision-only LLM QA evaluation on a rigorously filtered, diverse image datasetโ€”significantly outperforming CapRL-Qwen2.5VL-3B and Qwen2.5-VL-72B in captioning tasks while offering high performance and advanced abilities for charts, infographics, documents, and natural images with structured, hallucination-minimal outputs. As the top model in the CapRL series (vs. 2B for speed/efficiency), it delivers remarkable visual understanding, comprehensive information coverage, and well-organized descriptions via vLLM OpenAI-compatible API (gpu_memory_utilization=0.95), supporting base64-encoded images for precise text extraction and detailed captioning without SFT memorization issues. Part of ongoing advancements with QA curation code and GGUF quantizations, it's ideal for research, annotation, and production deployment balancing compute cost and superior perception.

CapRL-Qwen3VL-4B [GGUF]

File Name Quant Type File Size File Link
CapRL-Qwen3VL-4B.IQ4_XS.gguf IQ4_XS 2.49 GB Download
CapRL-Qwen3VL-4B.Q2_K.gguf Q2_K 1.8 GB Download
CapRL-Qwen3VL-4B.Q3_K_L.gguf Q3_K_L 2.41 GB Download
CapRL-Qwen3VL-4B.Q3_K_M.gguf Q3_K_M 2.24 GB Download
CapRL-Qwen3VL-4B.Q3_K_S.gguf Q3_K_S 2.05 GB Download
CapRL-Qwen3VL-4B.Q4_K_M.gguf Q4_K_M 2.72 GB Download
CapRL-Qwen3VL-4B.Q4_K_S.gguf Q4_K_S 2.6 GB Download
CapRL-Qwen3VL-4B.Q5_K_M.gguf Q5_K_M 3.16 GB Download
CapRL-Qwen3VL-4B.Q5_K_S.gguf Q5_K_S 3.09 GB Download
CapRL-Qwen3VL-4B.Q6_K.gguf Q6_K 3.63 GB Download
CapRL-Qwen3VL-4B.Q8_0.gguf Q8_0 4.69 GB Download
CapRL-Qwen3VL-4B.f16.gguf F16 8.83 GB Download
CapRL-Qwen3VL-4B.i1-IQ1_M.gguf i1-IQ1_M 1.25 GB Download
CapRL-Qwen3VL-4B.i1-IQ1_S.gguf i1-IQ1_S 1.18 GB Download
CapRL-Qwen3VL-4B.i1-IQ2_M.gguf i1-IQ2_M 1.68 GB Download
CapRL-Qwen3VL-4B.i1-IQ2_S.gguf i1-IQ2_S 1.58 GB Download
CapRL-Qwen3VL-4B.i1-IQ2_XS.gguf i1-IQ2_XS 1.48 GB Download
CapRL-Qwen3VL-4B.i1-IQ2_XXS.gguf i1-IQ2_XXS 1.37 GB Download
CapRL-Qwen3VL-4B.i1-IQ3_M.gguf i1-IQ3_M 2.13 GB Download
CapRL-Qwen3VL-4B.i1-IQ3_S.gguf i1-IQ3_S 2.07 GB Download
CapRL-Qwen3VL-4B.i1-IQ3_XS.gguf i1-IQ3_XS 1.98 GB Download
CapRL-Qwen3VL-4B.i1-IQ3_XXS.gguf i1-IQ3_XXS 1.84 GB Download
CapRL-Qwen3VL-4B.i1-IQ4_NL.gguf i1-IQ4_NL 2.6 GB Download
CapRL-Qwen3VL-4B.i1-IQ4_XS.gguf i1-IQ4_XS 2.48 GB Download
CapRL-Qwen3VL-4B.i1-Q2_K.gguf i1-Q2_K 1.8 GB Download
CapRL-Qwen3VL-4B.i1-Q2_K_S.gguf i1-Q2_K_S 1.69 GB Download
CapRL-Qwen3VL-4B.i1-Q3_K_L.gguf i1-Q3_K_L 2.41 GB Download
CapRL-Qwen3VL-4B.i1-Q3_K_M.gguf i1-Q3_K_M 2.24 GB Download
CapRL-Qwen3VL-4B.i1-Q3_K_S.gguf i1-Q3_K_S 2.05 GB Download
CapRL-Qwen3VL-4B.i1-Q4_0.gguf i1-Q4_0 2.59 GB Download
CapRL-Qwen3VL-4B.i1-Q4_1.gguf i1-Q4_1 2.84 GB Download
CapRL-Qwen3VL-4B.i1-Q4_K_M.gguf i1-Q4_K_M 2.72 GB Download
CapRL-Qwen3VL-4B.i1-Q4_K_S.gguf i1-Q4_K_S 2.6 GB Download
CapRL-Qwen3VL-4B.i1-Q5_K_M.gguf i1-Q5_K_M 3.16 GB Download
CapRL-Qwen3VL-4B.i1-Q5_K_S.gguf i1-Q5_K_S 3.09 GB Download
CapRL-Qwen3VL-4B.i1-Q6_K.gguf i1-Q6_K 3.63 GB Download
CapRL-Qwen3VL-4B.imatrix.gguf imatrix 3.87 MB Download
CapRL-Qwen3VL-4B.mmproj-Q8_0.gguf mmproj-Q8_0 454 MB Download
CapRL-Qwen3VL-4B.mmproj-f16.gguf mmproj-f16 836 MB Download

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Downloads last month
-
GGUF
Model size
4B params
Architecture
qwen3vl
Hardware compatibility
Log In to view the estimation

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for prithivMLmods/CapRL-Qwen3VL-4B-GGUF

Quantized
(4)
this model