German Artificial Analytics
a PeerBench project

German LLM Overview

Which model is best on German tasks — one score, ranked. Filter by open vs. closed weights, price, or speed.

Weight
Price
Speed
#
Model
Score
Price
Speed
1
Gemini 3.1 Pro
Google · Closed
80.3
78–89
$6.75
$$$ · /Mtok
127
tok/s
2
Gemini 3.5 Flash
Google · Closed
68.4
67–75
$2.81
$$ · /Mtok
202.5
tok/s
3
Qwen3.7 Max
Alibaba · Closed
57.7
51–67
$1.28
$$ · /Mtok
170.3
tok/s
4
Gemini 3.1 Flash-Lite
Google · Closed
56.4
52–65
$0.44
$ · /Mtok
302.3
tok/s
5
DeepSeek V4 Pro
DeepSeek · Open weights
54.1
50–59
$1.62
$$ · /Mtok
57.8
tok/s
6
Gemma 4 31B
Google · Open weights
52.4
49–57
$0.17
$ · /Mtok
38.4
tok/s
7
DeepSeek V4 Flash
DeepSeek · Open weights
52.1
48–58
$0.14
$ · /Mtok
102.5
tok/s
8
MiMo V2.5 Pro
Xiaomi · Open weights
51.7
46–61
$0.85
$$ · /Mtok
49.2
tok/s
9
Gemini 2.5 Flash
Google · Closed
51.0
47–57
$1.00
$$ · /Mtok
216.2
tok/s
10
Qwen3.6 35B-A3B
Alibaba · Open weights
48.7
46–53
$0.30
$ · /Mtok
141.6
tok/s
11
Claude Haiku 4.5
Anthropic · Closed
46.4
44–49
$3.36
$$$ · /Mtok
140.6
tok/s
12
Gemma 4 26B A4B
Google · Open weights
45.0
40–51
$0.23
$ · /Mtok
84.4
tok/s
13
Gemma 4 12B
Google · Open weights
44.8
40–49
46.3
tok/s
14
GLM-5.1
Z.ai · Open weights
44.7
36–52
$1.45
$$ · /Mtok
71.4
tok/s
15
Tencent HY3-Preview
Tencent · Open weights
44.7
38–48
$0.08
$ · /Mtok
94.8
tok/s
16
Qwen3.5 9B
Alibaba · Open weights
39.1
34–42
$0.12
$ · /Mtok
57.7
tok/s
17
Qwen3 14B
Alibaba · Open weights
35.4
29–39
$0.14
$ · /Mtok
64.7
tok/s

Showing 17 models that ran ≥3 of 7 benchmarks (9 excluded for thin coverage). Price = median effective $/1M tokens; Speed = throughput + latency.