DeepSeek R1 Distill Qwen 1.5B

deepseek-ai · released 2025-01-20 · mit license

DeepSeek R1 Distill Qwen 1.5B by deepseek-ai — 1.78B open-weight local LLM. License: mit. Benchmarks, VRAM requirements per quantization, GPU compatibility, pricing and community reviews on slopsome.com.

Key specs

TypeLocal open-weight
Parameters1.78B total
Architectureqwen2
Context window131K tokens
Knowledge cutoff
Modalitiestext
Recommended backends
Minimum viable rig

Benchmark scores

GPQA Diamond
SWE-bench Verified
AIME
MMLU-Pro
BFCL v3 (tool use)
Composite score
Community ratingNo reviews yet

VRAM & disk per quantization

QuantVRAMDiskRAMContext
Q4_K_M2.5 GB1 GB131K