Grok 4.20 (Reasoning)

Release Date: 3/9/2026

Vals Index

Accuracy (Vals Index)

39.11% ± 0.94

Latency (Vals Index)

195.12s

Cost/Test (Vals Index)

$0.49

Context Window

2M

Max Output Tokens

2M

Input Modality

Hyperparameter settings
Default Provider : xAI

Temperature

0.7

Top P

0.95

Top K

Default

Max Output Tokens

2,000,000

Benchmarks

Accuracy

Rankings

0.0%

± 0.94
15/ 16

0.0%

± 0.85
13/ 13

0.0%

± 0.95
22/ 104

0.0%

± 0.32
16/ 17

0.0%

± 2.12
47/ 56

0.0%

± 2.10
54/ 56

0.0%

± 0.99
62/ 73

0.0%

± 3.47
17/ 30

0.0%

± 3.42
31/ 54

0.0%

± 0.86
21/ 110

0.0%

± 2.06
36/ 43

0.0%

± 1.59
12/ 104

0.0%

± 7.49
8/ 53

0.0%

± 1.03
20/ 109

0.0%

± 0.48
71/ 107

0.0%

± 0.34
22/ 103

0.0%

± 0.89
14/ 70

0.0%

± 2.01
23/ 47

0.0%

± 5.23
28/ 61
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.