← Back to directory
Meta · Released 2025-09

Llama 4

Meta's flagship dense LLM. Comes in 8B, 70B, and 405B; 8B fits on a single 16GB GPU and is the most-downloaded local model on Hugging Face.

LlamaCommercial with caveats👁 Visiongeneralvision
Params (max)
405B
Variants
405B / 70B / 8B
Context window
128K tokens
MMLU
87.1
HumanEval
84.5
GSM8K
93
Min VRAM (fp16, smallest variant)
16GB
Smallest Q4 GGUF
~5GB
Languages supported
12
Pros
  • Massive ecosystem
  • Strong instruction following
  • Fine-tunes everywhere
Cons
  • ×License restricts >700M MAU products
  • ×405B variant heavy for self-hosting

Highlights

  • 8B variant runs on a laptop
  • Vision input on 90B+
  • Most ecosystem support of any open LLM

Where to download

Hugging Face: meta-llama/Meta-Llama-4-405B
Or via Ollama (ollama pull llama-4) or LM Studio's in-app browser.

Related reading

Compare Llama 4 with