Quick start
ollama run granite3-moeAvailable sizes
| Tag | Size | Quantization | Context | Min RAM |
|---|---|---|---|---|
| granite3-moe:3b | 2.1GB | q4_k_m | 4K context | 2.6 GB |
Run with
Claude Code
ollama launch claude --model granite3-moeCodex
ollama launch codex --model granite3-moeOpenCode
ollama launch opencode --model granite3-moeOpenClaw
ollama launch openclaw --model granite3-moeStrengths & Limitations
Strengths
- Low latency
- Mixture of Experts (MoE)
- Designed for efficient usage
Related models
llama3.1Language
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
110.5M pullsllama3.2Language
Meta's Llama 3.2 goes small with 1B and 3B models.
58.0M pullsmistralLanguage
The 7B model released by Mistral AI, updated to version 0.3.
25.6M pullsqwen2.5Language
Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
22.0M pulls