Grok 4

Release Date: 7/9/2025

Accuracy (Average)

60.37%

Latency (Average)

495.24s

Avg. Cost (In/Out)

3 / 15

Context Window

256k

Max Output Tokens

128k

Input Modality

Hyperparameter settings
Default Provider : xAI

Temperature

0.7

Top P

0.95

Top K

Default

Max Output Tokens

128,000

Benchmarks

Accuracy

Rankings

0.0%

± 0.93
12/ 108

0.0%

± 2.21
35/ 60

0.0%

± 2.08
25/ 60

0.0%

± 0.96
66/ 76

0.0%

± 2.98
52/ 57

0.0%

± 0.96
88/ 114

0.0%

± 1.63
16/ 108

0.0%

± 6.76
10/ 54

0.0%

± 1.03
31/ 113

0.0%

± 0.47
30/ 111

0.0%

± 0.35
34/ 107

0.0%

± 1.02
34/ 73

0.0%

± 2.21
44/ 50
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.