Grok 4

Release Date: 7/9/2025

Vals Index

Accuracy (Vals Index)

54.65% ± 1.97

Latency (Vals Index)

399.36s

Cost/Test (Vals Index)

$0.72

Context Window

256k

Max Output Tokens

128k

Input Modality

Hyperparameter settings
Default Provider : xAI

Temperature

0.7

Top P

0.95

Top K

Default

Max Output Tokens

128,000

Benchmarks

Accuracy

Rankings

0.0%

± 1.97
24/ 39

0.0%

± 1.54
23/ 27

0.0%

± 0.21
6/ 46

0.0%

± 0.93
6/ 96

0.0%

± 2.85
14/ 44

0.0%

± 2.21
28/ 50

0.0%

± 2.08
21/ 50

0.0%

± 0.96
59/ 68

0.0%

± 2.98
45/ 48

0.0%

± 0.96
78/ 103

0.0%

± 1.76
26/ 95

0.0%

± 1.63
8/ 98

0.0%

± 6.76
7/ 50

0.0%

± 1.03
22/ 102

0.0%

± 0.47
23/ 116

0.0%

± 0.24
32/ 95

0.0%

± 0.35
25/ 96

0.0%

± 1.02
27/ 65

0.0%

± 2.21
34/ 40

0.0%

± 4.79
34/ 51
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.