Qwen3 235B A22B by Qwen — 235.09B MoE (22B active) open-weight local LLM. License: apache-2.0. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopsome.com.
Key specs
| Type | Local open-weight |
|---|
| Parameters | 235.09B total · MoE, 22B active |
|---|
| Architecture | qwen3_moe |
|---|
| Context window | 41K tokens |
|---|
| Knowledge cutoff | 2025-02-01 |
|---|
| Modalities | text |
|---|
| Recommended backends | — |
|---|
| Minimum viable rig | — |
|---|
Benchmark scores
| GPQA Diamond | 71% |
|---|
| SWE-bench Verified | 62% |
|---|
| AIME | 85% |
|---|
| MMLU-Pro | 83% |
|---|
| BFCL v3 (tool use) | 73% |
|---|
| Composite score | 6.88 |
|---|
| Community rating | No reviews yet |
|---|
VRAM & disk per quantization
| Quant | VRAM | Disk | RAM | Context |
|---|
| Q8 | 240 GB | 235 GB | 256 GB | 131K |
| Q4_K_M | 137.9 GB | 136.4 GB | 160 GB | 41K |
API pricing (per 1M tokens)
| Provider | Input | Output | Free tier |
|---|
| Together AI | $0.2 | $0.6 | No |
| Fireworks AI | $0.22 | $0.88 | No |
| SiliconFlow | $0.35 | $1.4 | No |