German Artificial Analytics
a PeerBench project

German LLM Benchmark · Model Profile

gpt-oss-120b

OpenAI unverified run 2026-05-29
66 .2%

avg. German score

#26 of 30 models

−5.9pp below avg.

Benchmark breakdown

INCLUDE

66.2%

Native German exam and licensing questions covering region-specific knowledge — history, law, civics and culture. Written by humans in German, not translated.

4-option multiple choice native · Native German
via CohereLabs/include-base-44 ↗

Cost & speed

$0.056 per 1,000 questions
1641 tokens / second 🔒
0.31s time to first token