Qwen3 4B Thinking 2507 by Qwen — 4.02B open-weight local LLM. License: apache-2.0. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopsome.com.
Key specs
| Type | Local open-weight |
|---|
| Parameters | 4.02B total |
|---|
| Architecture | qwen3 |
|---|
| Context window | 262K tokens |
|---|
| Knowledge cutoff | — |
|---|
| Modalities | text |
|---|
| Recommended backends | — |
|---|
| Minimum viable rig | — |
|---|
Benchmark scores
| GPQA Diamond | — |
|---|
| SWE-bench Verified | — |
|---|
| AIME | — |
|---|
| MMLU-Pro | — |
|---|
| BFCL v3 (tool use) | — |
|---|
| Composite score | — |
|---|
| Community rating | No reviews yet |
|---|
VRAM & disk per quantization
| Quant | VRAM | Disk | RAM | Context |
|---|
| Q4_K_M | 3.8 GB | 2.3 GB | — | 262K |