Quick start
ollama run nemotronAvailable sizes
| Tag | Size | Quantization | Context | Min RAM |
|---|---|---|---|---|
| nemotron:latest | 43GB | q4_k_m | 128K context | 53.8 GB |
Run with
Claude Code
ollama launch claude --model nemotronCodex
ollama launch codex --model nemotronOpenCode
ollama launch opencode --model nemotronOpenClaw
ollama launch openclaw --model nemotronStrengths & Limitations
Strengths
- Helpful responses
- LLM customization
- NVIDIA optimized
Related models
gemma3General
The current, most capable model that runs on a single GPU.
32.1M pullsllama3General
Meta Llama 3: The most capable openly available LLM to date
16.1M pullsgpt-ossGeneral
OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.
7.1M pullsdolphin3General
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.6M pulls