Skip to main content
Ollama ExplorerBeta
GeneraladvancedTools

nemotron

NVIDIALlama

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

265K pullsUpdated Feb 26, 202517 tags128K context

Quick start

ollama run nemotron

Available sizes

TagSizeQuantizationContextMin RAM
nemotron:latest43GBq4_k_m128K context53.8 GB

Run with

Claude Code
ollama launch claude --model nemotron
Codex
ollama launch codex --model nemotron
OpenCode
ollama launch opencode --model nemotron
OpenClaw
ollama launch openclaw --model nemotron

Strengths & Limitations

Strengths

  • Helpful responses
  • LLM customization
  • NVIDIA optimized

Related models