NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response.
The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.
Modalities
In / Out Price
Low
$0.04 / $0.16per 1M
Context
Avg
131K
Released
Sep 5, 2025
Knowledge Cutoff
Mar 2025
Going away June 11, 2026