CapRL-Qwen3VL-4B-GGUF
CapRL-Qwen3VL-4B from internlm is a 4B-parameter vision-language model from the CapRL 2.0 series, fine-tuned from Qwen3-VL-4B using an upgraded Reinforcement Learning with Verifiable Rewards (RLVR) two-stage pipelineโLVLMs generate rich captions followed by vision-only LLM QA evaluation on a rigorously filtered, diverse image datasetโsignificantly outperforming CapRL-Qwen2.5VL-3B and Qwen2.5-VL-72B in captioning tasks while offering high performance and advanced abilities for charts, infographics, documents, and natural images with structured, hallucination-minimal outputs. As the top model in the CapRL series (vs. 2B for speed/efficiency), it delivers remarkable visual understanding, comprehensive information coverage, and well-organized descriptions via vLLM OpenAI-compatible API (gpu_memory_utilization=0.95), supporting base64-encoded images for precise text extraction and detailed captioning without SFT memorization issues. Part of ongoing advancements with QA curation code and GGUF quantizations, it's ideal for research, annotation, and production deployment balancing compute cost and superior perception.
CapRL-Qwen3VL-4B [GGUF]
| File Name | Quant Type | File Size | File Link |
|---|---|---|---|
| CapRL-Qwen3VL-4B.IQ4_XS.gguf | IQ4_XS | 2.49 GB | Download |
| CapRL-Qwen3VL-4B.Q2_K.gguf | Q2_K | 1.8 GB | Download |
| CapRL-Qwen3VL-4B.Q3_K_L.gguf | Q3_K_L | 2.41 GB | Download |
| CapRL-Qwen3VL-4B.Q3_K_M.gguf | Q3_K_M | 2.24 GB | Download |
| CapRL-Qwen3VL-4B.Q3_K_S.gguf | Q3_K_S | 2.05 GB | Download |
| CapRL-Qwen3VL-4B.Q4_K_M.gguf | Q4_K_M | 2.72 GB | Download |
| CapRL-Qwen3VL-4B.Q4_K_S.gguf | Q4_K_S | 2.6 GB | Download |
| CapRL-Qwen3VL-4B.Q5_K_M.gguf | Q5_K_M | 3.16 GB | Download |
| CapRL-Qwen3VL-4B.Q5_K_S.gguf | Q5_K_S | 3.09 GB | Download |
| CapRL-Qwen3VL-4B.Q6_K.gguf | Q6_K | 3.63 GB | Download |
| CapRL-Qwen3VL-4B.Q8_0.gguf | Q8_0 | 4.69 GB | Download |
| CapRL-Qwen3VL-4B.f16.gguf | F16 | 8.83 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ1_M.gguf | i1-IQ1_M | 1.25 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ1_S.gguf | i1-IQ1_S | 1.18 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ2_M.gguf | i1-IQ2_M | 1.68 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ2_S.gguf | i1-IQ2_S | 1.58 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ2_XS.gguf | i1-IQ2_XS | 1.48 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ2_XXS.gguf | i1-IQ2_XXS | 1.37 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ3_M.gguf | i1-IQ3_M | 2.13 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ3_S.gguf | i1-IQ3_S | 2.07 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ3_XS.gguf | i1-IQ3_XS | 1.98 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ3_XXS.gguf | i1-IQ3_XXS | 1.84 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ4_NL.gguf | i1-IQ4_NL | 2.6 GB | Download |
| CapRL-Qwen3VL-4B.i1-IQ4_XS.gguf | i1-IQ4_XS | 2.48 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q2_K.gguf | i1-Q2_K | 1.8 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q2_K_S.gguf | i1-Q2_K_S | 1.69 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q3_K_L.gguf | i1-Q3_K_L | 2.41 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q3_K_M.gguf | i1-Q3_K_M | 2.24 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q3_K_S.gguf | i1-Q3_K_S | 2.05 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q4_0.gguf | i1-Q4_0 | 2.59 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q4_1.gguf | i1-Q4_1 | 2.84 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q4_K_M.gguf | i1-Q4_K_M | 2.72 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q4_K_S.gguf | i1-Q4_K_S | 2.6 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q5_K_M.gguf | i1-Q5_K_M | 3.16 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q5_K_S.gguf | i1-Q5_K_S | 3.09 GB | Download |
| CapRL-Qwen3VL-4B.i1-Q6_K.gguf | i1-Q6_K | 3.63 GB | Download |
| CapRL-Qwen3VL-4B.imatrix.gguf | imatrix | 3.87 MB | Download |
| CapRL-Qwen3VL-4B.mmproj-Q8_0.gguf | mmproj-Q8_0 | 454 MB | Download |
| CapRL-Qwen3VL-4B.mmproj-f16.gguf | mmproj-f16 | 836 MB | Download |
Quants Usage
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):
- Downloads last month
- -
Model tree for prithivMLmods/CapRL-Qwen3VL-4B-GGUF
Base model
internlm/CapRL-Qwen3VL-4B