Quick start
ollama run nous-hermesAvailable sizes
| Tag | Size | Quantization | Context | Min RAM |
|---|---|---|---|---|
| nous-hermes:latest | 3.8GB | q4_k_m | 4K context | 4.8 GB |
| nous-hermes:13b | 7.4GB | q4_k_m | 4K context | 9.2 GB |
Strengths & Limitations
Strengths
- General purpose language tasks
- Based on Llama and Llama 2 architectures
- Good performance across a range of benchmarks
Related models
gemma3General
The current, most capable model that runs on a single GPU.
32.1M pullsllama3General
Meta Llama 3: The most capable openly available LLM to date
16.1M pullsgpt-ossGeneral
OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.
7.1M pullsdolphin3General
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.6M pulls