Meta Llama 3.1 70B Instruct Quantized.W4a16

RedHatAI · released 2024-07-31 · llama3.1 license

Meta Llama 3.1 70B Instruct Quantized.W4a16 by RedHatAI — 70.55B open-weight local LLM. License: llama3.1. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopsome.com.

Key specs

TypeLocal open-weight
Parameters70.55B total
Architecturellama
Context window131K tokens
Knowledge cutoff
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond
SWE-bench Verified
AIME
MMLU-Pro
BFCL v3 (tool use)
Composite score
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M42.4 GB40.9 GB131K