CapRL-Qwen3VL-2B-GGUF

CapRL-Qwen3VL-2B from internlm is a 2B-parameter vision-language model from the CapRL 2.0 series, fine-tuned from Qwen3-VL-2B using a decoupled two-stage Reinforcement Learning with Verifiable Rewards (RLVR) paradigm—initial LVLMs generate rich captions followed by vision-only LLM QA evaluation on a rigorously filtered 75K dataset—outperforming both CapRL-Qwen2.5VL-3B and the much larger Qwen2.5-VL-72B in image captioning tasks while prioritizing speed and efficiency for charts, infographics, documents, and natural images with well-structured, hallucination-minimal outputs. It excels in dense visual understanding with comprehensive coverage of valid information, making it ideal for lightweight deployment via vLLM OpenAI-compatible API (gpu_memory_utilization=0.95, tensor-parallel-size=1) on standard hardware, supporting base64-encoded images for precise text extraction and description without traditional SFT memorization limitations. Part of an ongoing series (including CapRL-Qwen3VL-4B for advanced needs), it leverages upgraded training recipes with diverse image datasets and QA curation code for research, annotation, and production captioning, with GGUF quantizations available.

CapRL-Qwen3VL-2B [GGUF]

File Name	Quant Type	File Size	File Link
CapRL-Qwen3VL-2B.IQ4_XS.gguf	IQ4_XS	1.18 GB	Download
CapRL-Qwen3VL-2B.Q2_K.gguf	Q2_K	880 MB	Download
CapRL-Qwen3VL-2B.Q3_K_L.gguf	Q3_K_L	1.14 GB	Download
CapRL-Qwen3VL-2B.Q3_K_M.gguf	Q3_K_M	1.07 GB	Download
CapRL-Qwen3VL-2B.Q3_K_S.gguf	Q3_K_S	1 GB	Download
CapRL-Qwen3VL-2B.Q4_K_M.gguf	Q4_K_M	1.28 GB	Download
CapRL-Qwen3VL-2B.Q4_K_S.gguf	Q4_K_S	1.24 GB	Download
CapRL-Qwen3VL-2B.Q5_K_M.gguf	Q5_K_M	1.47 GB	Download
CapRL-Qwen3VL-2B.Q5_K_S.gguf	Q5_K_S	1.44 GB	Download
CapRL-Qwen3VL-2B.Q6_K.gguf	Q6_K	1.67 GB	Download
CapRL-Qwen3VL-2B.Q8_0.gguf	Q8_0	2.17 GB	Download
CapRL-Qwen3VL-2B.f16.gguf	F16	4.07 GB	Download
CapRL-Qwen3VL-2B.i1-IQ1_M.gguf	i1-IQ1_M	646 MB	Download
CapRL-Qwen3VL-2B.i1-IQ1_S.gguf	i1-IQ1_S	618 MB	Download
CapRL-Qwen3VL-2B.i1-IQ2_M.gguf	i1-IQ2_M	829 MB	Download
CapRL-Qwen3VL-2B.i1-IQ2_S.gguf	i1-IQ2_S	792 MB	Download
CapRL-Qwen3VL-2B.i1-IQ2_XS.gguf	i1-IQ2_XS	734 MB	Download
CapRL-Qwen3VL-2B.i1-IQ2_XXS.gguf	i1-IQ2_XXS	693 MB	Download
CapRL-Qwen3VL-2B.i1-IQ3_M.gguf	i1-IQ3_M	1.03 GB	Download
CapRL-Qwen3VL-2B.i1-IQ3_S.gguf	i1-IQ3_S	1 GB	Download
CapRL-Qwen3VL-2B.i1-IQ3_XS.gguf	i1-IQ3_XS	968 MB	Download
CapRL-Qwen3VL-2B.i1-IQ3_XXS.gguf	i1-IQ3_XXS	888 MB	Download
CapRL-Qwen3VL-2B.i1-IQ4_NL.gguf	i1-IQ4_NL	1.23 GB	Download
CapRL-Qwen3VL-2B.i1-IQ4_XS.gguf	i1-IQ4_XS	1.18 GB	Download
CapRL-Qwen3VL-2B.i1-Q2_K.gguf	i1-Q2_K	880 MB	Download
CapRL-Qwen3VL-2B.i1-Q2_K_S.gguf	i1-Q2_K_S	835 MB	Download
CapRL-Qwen3VL-2B.i1-Q3_K_L.gguf	i1-Q3_K_L	1.14 GB	Download
CapRL-Qwen3VL-2B.i1-Q3_K_M.gguf	i1-Q3_K_M	1.07 GB	Download
CapRL-Qwen3VL-2B.i1-Q3_K_S.gguf	i1-Q3_K_S	1 GB	Download
CapRL-Qwen3VL-2B.i1-Q4_0.gguf	i1-Q4_0	1.23 GB	Download
CapRL-Qwen3VL-2B.i1-Q4_1.gguf	i1-Q4_1	1.34 GB	Download
CapRL-Qwen3VL-2B.i1-Q4_K_M.gguf	i1-Q4_K_M	1.28 GB	Download
CapRL-Qwen3VL-2B.i1-Q4_K_S.gguf	i1-Q4_K_S	1.24 GB	Download
CapRL-Qwen3VL-2B.i1-Q5_K_M.gguf	i1-Q5_K_M	1.47 GB	Download
CapRL-Qwen3VL-2B.i1-Q5_K_S.gguf	i1-Q5_K_S	1.44 GB	Download
CapRL-Qwen3VL-2B.i1-Q6_K.gguf	i1-Q6_K	1.67 GB	Download
CapRL-Qwen3VL-2B.imatrix.gguf	imatrix	2.09 MB	Download
CapRL-Qwen3VL-2B.mmproj-Q8_0.gguf	mmproj-Q8_0	445 MB	Download
CapRL-Qwen3VL-2B.mmproj-f16.gguf	mmproj-f16	819 MB	Download

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

Downloads last month: 51

GGUF

Model size

2B params

Architecture

qwen3vl

Hardware compatibility

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

View +1 variant

Model tree for prithivMLmods/CapRL-Qwen3VL-2B-GGUF

Base model

internlm/CapRL-Qwen3VL-2B

Quantized

(5)

this model