NVIDIA-Nemotron-Nano-12B-v2 GGUF

Recommended way to run this model:

llama-server -hf danbev/NVIDIA-Nemotron-Nano-12B-v2-GGUF -c 0 -fa

Then, access http://localhost:8080

Downloads last month
15
GGUF
Model size
12B params
Architecture
nemotron_h
Hardware compatibility
Log In to view the estimation

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support