Reward Models 10-2025 - a nvidia Collection

nvidia 's Collections

NVIDIA Nemotron v3

Nemotron-Labs-Diffusion

Inference Optimized Checkpoints (with Model Optimizer)

Nemotron-Labs-Elastic

swe-zero-to-swe-hero

Nemotron Supervised Fine-Tuning

Nemotron Vision-Language

Nemotron Agentic & Tool-Use

Nemotron Safety & Content Moderation

Nemotron Chat & Instruction Following

Nemotron Code & SWE

Nemotron Math & Reasoning

Nemotron Reward Modeling

Nemotron Reinforcement Learning

Nemotron-Cascade 2

BioNeMo - Design

MedTech Open Models

Nemotron-Terminal

Nemotron Speech

Speculative Decoding Modules

Nemotron OCR and Object Detection

Nemotron ColEmbed V2

Steering Reasoning VLAs

NVIDIA Cosmos 2

Nemotron-Cascade

Nemotron-Post-Training-v3

Nemotron-Pre-Training-Datasets

NVIDIA Nemotron V2

Cosmos-Drive-Dreams

Reward Models 10-2025

BioNeMo - Understand

BioNeMo - Optimize

Cosmos-Predict2.5

Nemotron-Personas

Llama-Embed-Nemotron-8B

Reasoning Efficiency Research

OpenReasoning-Nemotron

Cosmos-Predict2

Reward Models 06-2025

Cosmos-Transfer2.5

Describe Anything

OpenMathReasoning

OpenCodeReasoning

OpenCodeReasoning-II

Llama Nemotron Feedback-Edit Inference-Time Scaling

Scoring Verifiers

Nemotron-UltraLong

Cosmos-Transfer1

Cosmos-Tokenize1

Cosmos-Predict1

Cosmos-Tokenizer

Llama-3.1-Nemotron-70B

NVILA-Speech-Audio-Setups

NeMo Audio Codecs

Optimized ONNX models for NVIDIA RTX GPUs

Nemotron 4 340B

Llama3-ChatQA-1.5

PS3: Scaling Vision Pre-Training to 4K Resolution

Llama3-ChatQA-2

NeMo Curator - Classifier Models

Nemotron v3 Pre-Training

Reward Models 10-2025

updated about 10 hours ago

A collection of great reward models for research and production