Skip to main content
Ollama ExplorerBeta
Languageintermediate

llama3-chatqa

NVIDIALlama

A model from NVIDIA based on Llama 3 that excels at conversational question answering (QA) and retrieval-augmented generation (RAG).

405K pullsUpdated Feb 26, 202535 tags8K context

Quick start

ollama run llama3-chatqa

Available sizes

TagSizeQuantizationContextMin RAM
llama3-chatqa:latest4.7GBq4_k_m8K context5.9 GB
llama3-chatqa:70b40GBq4_k_m8K context50 GB

Strengths & Limitations

Strengths

  • Conversational QA
  • Retrieval-augmented generation
  • Based on Llama 3

Related models