LLaDA2.1 Flash

inclusionAI · released 2026-02-09 · apache-2.0 license

LLaDA2.1 Flash by inclusionAI — 102.89B open-weight local LLM. License: apache-2.0. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopsome.com.

Key specs

TypeLocal open-weight
Parameters102.89B total · MoE, — active
Architecturellada2_moe
Context window33K tokens
Knowledge cutoff
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond
SWE-bench Verified
AIME
MMLU-Pro
BFCL v3 (tool use)
Composite score
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M61.2 GB59.7 GB33K