Gemma 2 27B

Google · Gemma 2 family · released 2024-06-01 · Gemma license

A strong, efficient 2024 dense model — great on one GPU, but its 8k context is small by today's standards.

Key specs

TypeLocal open-weight
Parameters27.2B total
ArchitectureDense decoder-only with interleaved local-global attention (4k sliding window), GQA, logit soft-capping
Context window8K tokens
Knowledge cutoff
Modalitiestext
Recommended backends
Minimum viable rig~16GB VRAM at 4-bit (~54GB BF16)

Benchmark scores

GPQA Diamond
SWE-bench Verified
AIME
MMLU-Pro
BFCL v3 (tool use)
Composite score
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M17 GB16 GB28 GB8K

Strengths & weaknesses

Strengths: Strong size-class quality; beat Llama 3 70B on LMSYS Arena Elo at release; Runs comfortably on a single GPU; Broad tooling support

Weaknesses: Short 8k native context; English-only output; No reasoning mode or vision (Gemma 3+ features)