GLM 4.7 Flash by zai-org — 31.22B open-weight local LLM. License: mit. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopsome.com.
Key specs
| Type | Local open-weight |
|---|
| Parameters | 31.22B total · MoE, — active |
|---|
| Architecture | glm4_moe_lite |
|---|
| Context window | 203K tokens |
|---|
| Knowledge cutoff | — |
|---|
| Modalities | text |
|---|
| Recommended backends | — |
|---|
| Minimum viable rig | — |
|---|
Benchmark scores
| GPQA Diamond | — |
|---|
| SWE-bench Verified | — |
|---|
| AIME | — |
|---|
| MMLU-Pro | — |
|---|
| BFCL v3 (tool use) | — |
|---|
| Composite score | — |
|---|
| Community rating | No reviews yet |
|---|
VRAM & disk per quantization
| Quant | VRAM | Disk | RAM | Context |
|---|
| Q4_K_M | 19.6 GB | 18.1 GB | — | 203K |