CapRL-Qwen3VL-2B-GGUF

CapRL-Qwen3VL-2B from internlm is a 2B-parameter vision-language model from the CapRL 2.0 series, fine-tuned from Qwen3-VL-2B using a decoupled two-stage Reinforcement Learning with Verifiable Rewards (RLVR) paradigmโ€”initial LVLMs generate rich captions followed by vision-only LLM QA evaluation on a rigorously filtered 75K datasetโ€”outperforming both CapRL-Qwen2.5VL-3B and the much larger Qwen2.5-VL-72B in image captioning tasks while prioritizing speed and efficiency for charts, infographics, documents, and natural images with well-structured, hallucination-minimal outputs. It excels in dense visual understanding with comprehensive coverage of valid information, making it ideal for lightweight deployment via vLLM OpenAI-compatible API (gpu_memory_utilization=0.95, tensor-parallel-size=1) on standard hardware, supporting base64-encoded images for precise text extraction and description without traditional SFT memorization limitations. Part of an ongoing series (including CapRL-Qwen3VL-4B for advanced needs), it leverages upgraded training recipes with diverse image datasets and QA curation code for research, annotation, and production captioning, with GGUF quantizations available.

CapRL-Qwen3VL-2B [GGUF]

File Name Quant Type File Size File Link
CapRL-Qwen3VL-2B.IQ4_XS.gguf IQ4_XS 1.18 GB Download
CapRL-Qwen3VL-2B.Q2_K.gguf Q2_K 880 MB Download
CapRL-Qwen3VL-2B.Q3_K_L.gguf Q3_K_L 1.14 GB Download
CapRL-Qwen3VL-2B.Q3_K_M.gguf Q3_K_M 1.07 GB Download
CapRL-Qwen3VL-2B.Q3_K_S.gguf Q3_K_S 1 GB Download
CapRL-Qwen3VL-2B.Q4_K_M.gguf Q4_K_M 1.28 GB Download
CapRL-Qwen3VL-2B.Q4_K_S.gguf Q4_K_S 1.24 GB Download
CapRL-Qwen3VL-2B.Q5_K_M.gguf Q5_K_M 1.47 GB Download
CapRL-Qwen3VL-2B.Q5_K_S.gguf Q5_K_S 1.44 GB Download
CapRL-Qwen3VL-2B.Q6_K.gguf Q6_K 1.67 GB Download
CapRL-Qwen3VL-2B.Q8_0.gguf Q8_0 2.17 GB Download
CapRL-Qwen3VL-2B.f16.gguf F16 4.07 GB Download
CapRL-Qwen3VL-2B.i1-IQ1_M.gguf i1-IQ1_M 646 MB Download
CapRL-Qwen3VL-2B.i1-IQ1_S.gguf i1-IQ1_S 618 MB Download
CapRL-Qwen3VL-2B.i1-IQ2_M.gguf i1-IQ2_M 829 MB Download
CapRL-Qwen3VL-2B.i1-IQ2_S.gguf i1-IQ2_S 792 MB Download
CapRL-Qwen3VL-2B.i1-IQ2_XS.gguf i1-IQ2_XS 734 MB Download
CapRL-Qwen3VL-2B.i1-IQ2_XXS.gguf i1-IQ2_XXS 693 MB Download
CapRL-Qwen3VL-2B.i1-IQ3_M.gguf i1-IQ3_M 1.03 GB Download
CapRL-Qwen3VL-2B.i1-IQ3_S.gguf i1-IQ3_S 1 GB Download
CapRL-Qwen3VL-2B.i1-IQ3_XS.gguf i1-IQ3_XS 968 MB Download
CapRL-Qwen3VL-2B.i1-IQ3_XXS.gguf i1-IQ3_XXS 888 MB Download
CapRL-Qwen3VL-2B.i1-IQ4_NL.gguf i1-IQ4_NL 1.23 GB Download
CapRL-Qwen3VL-2B.i1-IQ4_XS.gguf i1-IQ4_XS 1.18 GB Download
CapRL-Qwen3VL-2B.i1-Q2_K.gguf i1-Q2_K 880 MB Download
CapRL-Qwen3VL-2B.i1-Q2_K_S.gguf i1-Q2_K_S 835 MB Download
CapRL-Qwen3VL-2B.i1-Q3_K_L.gguf i1-Q3_K_L 1.14 GB Download
CapRL-Qwen3VL-2B.i1-Q3_K_M.gguf i1-Q3_K_M 1.07 GB Download
CapRL-Qwen3VL-2B.i1-Q3_K_S.gguf i1-Q3_K_S 1 GB Download
CapRL-Qwen3VL-2B.i1-Q4_0.gguf i1-Q4_0 1.23 GB Download
CapRL-Qwen3VL-2B.i1-Q4_1.gguf i1-Q4_1 1.34 GB Download
CapRL-Qwen3VL-2B.i1-Q4_K_M.gguf i1-Q4_K_M 1.28 GB Download
CapRL-Qwen3VL-2B.i1-Q4_K_S.gguf i1-Q4_K_S 1.24 GB Download
CapRL-Qwen3VL-2B.i1-Q5_K_M.gguf i1-Q5_K_M 1.47 GB Download
CapRL-Qwen3VL-2B.i1-Q5_K_S.gguf i1-Q5_K_S 1.44 GB Download
CapRL-Qwen3VL-2B.i1-Q6_K.gguf i1-Q6_K 1.67 GB Download
CapRL-Qwen3VL-2B.imatrix.gguf imatrix 2.09 MB Download
CapRL-Qwen3VL-2B.mmproj-Q8_0.gguf mmproj-Q8_0 445 MB Download
CapRL-Qwen3VL-2B.mmproj-f16.gguf mmproj-f16 819 MB Download

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

image.png

Downloads last month
177
GGUF
Model size
2B params
Architecture
qwen3vl
Hardware compatibility
Log In to view the estimation

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for prithivMLmods/CapRL-Qwen3VL-2B-GGUF

Quantized
(5)
this model