Grok 4.20 (Reasoning)

Release Date: 3/9/2026

Vals Index

Accuracy (Vals Index)

56.59% ± 2.00

Latency (Vals Index)

80.66s

Cost/Test (Vals Index)

$0.26

Context Window

2M

Max Output Tokens

2M

Input Modality

Hyperparameter settings
Default Provider : xAI

Temperature

0.7

Top P

0.95

Top K

Default

Max Output Tokens

2,000,000

Benchmarks

Accuracy

Rankings

0.0%

± 2.00
19/ 39

0.0%

± 1.57
18/ 27

0.0%

± 0.28
33/ 46

0.0%

± 0.95
17/ 96

0.0%

± 0.00
18/ 44

0.0%

± 2.12
41/ 50

0.0%

± 2.10
48/ 50

0.0%

± 0.99
58/ 68

0.0%

± 3.49
13/ 23

0.0%

± 3.43
27/ 48

0.0%

± 0.86
18/ 103

0.0%

± 2.06
23/ 25

0.0%

± 0.52
6/ 95

0.0%

± 1.59
7/ 98

0.0%

± 7.49
6/ 50

0.0%

± 1.03
14/ 102

0.0%

± 0.48
67/ 116

0.0%

± 0.21
16/ 95

0.0%

± 0.34
18/ 96

0.0%

± 0.89
11/ 65

0.0%

± 2.01
18/ 40

0.0%

± 5.23
20/ 51
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.