Grok 4.20 (Reasoning)

Release Date: 3/9/2026

Vals Index

Accuracy (Vals Index)

39.50% ± 0.77

Latency (Vals Index)

223.99s

Cost/Test (Vals Index)

$0.54

Context Window

2M

Max Output Tokens

2M

Input Modality

Hyperparameter settings
Default Provider : xAI

Temperature

0.7

Top P

0.95

Top K

Default

Max Output Tokens

2,000,000

Benchmarks

Accuracy

Rankings

0.0%

± 0.77
26/ 31

0.0%

± 0.70
19/ 21

0.0%

± 0.95
29/ 116

0.0%

± 0.32
25/ 28

0.0%

± 2.12
55/ 68

0.0%

± 2.10
62/ 65

0.0%

± 0.99
69/ 80

0.0%

± 3.47
25/ 42

0.0%

± 3.42
37/ 61

0.0%

± 0.86
25/ 122

0.0%

± 2.06
56/ 66

0.0%

± 1.59
17/ 116

0.0%

± 7.49
10/ 55

0.0%

± 1.03
25/ 122

0.0%

± 0.48
79/ 119

0.0%

± 0.34
27/ 115

0.0%

± 0.89
17/ 76

0.0%

± 2.01
34/ 64

0.0%

± 0.99
27/ 35
Contact us
Or send us an email at contact@vals.ai

License type:

Proprietary (contact us to get access)
Industry Partner
Academic

Read our methodology.