Quick start
ollama run nemotron-3-nanoAvailable sizes
| Tag | Size | Quantization | Context | Min RAM |
|---|---|---|---|---|
| nemotron-3-nano:latest | 24GB | q4_k_m | 1M context | 30 GB |
Run with
Claude Code
ollama launch claude --model nemotron-3-nanoCodex
ollama launch codex --model nemotron-3-nanoOpenCode
ollama launch opencode --model nemotron-3-nanoOpenClaw
ollama launch openclaw --model nemotron-3-nanoStrengths & Limitations
Strengths
- Efficient performance
- Open-source availability
- Intelligent agentic capabilities
Benchmarks
| Benchmark | Score | Unit |
|---|---|---|
| MMLU-Pro | 78.3 | — |
| AIME25 | — | — |
| GPQA (no tools) | 73 | — |
| GPQA (with tools) | 75 | — |
| SWE-Bench (OpenHands) | 38.8 | — |
| MMLU-ProX (avg over langs) | 59.5 | — |
Related models
llama3.1Language
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
110.5M pullsllama3.2Language
Meta's Llama 3.2 goes small with 1B and 3B models.
58.0M pullsmistralLanguage
The 7B model released by Mistral AI, updated to version 0.3.
25.6M pullsqwen2.5Language
Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
22.0M pulls