Skip to main content
Ollama ExplorerBeta
MultimodaladvancedVisionTools

glm-ocr

Alibaba CloudOther

GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

46K pullsUpdated Feb 5, 20263 tags128K context

Quick start

ollama run glm-ocr

Available sizes

TagSizeQuantizationContextMin RAM
glm-ocr:q8_01.6GBq8_0128K context2 GB
glm-ocr:latest2.2GBq4_k_m128K context2.8 GB
glm-ocr:bf162.2GBbf16128K2.8 GB

Run with

Claude Code
ollama launch claude --model glm-ocr
Codex
ollama launch codex --model glm-ocr
OpenCode
ollama launch opencode --model glm-ocr
OpenClaw
ollama launch openclaw --model glm-ocr

Strengths & Limitations

Strengths

  • Complex document understanding
  • Multimodal OCR
  • GLM-V architecture

Related models