Grok 4
Release Date: 7/9/2025
Benchmarked by
Latest and greatest flagship model offering unparalleled performance in natural language, math and reasoning. The perfect jack of all trades with native tool use and structured outputs support.
Accuracy (Vals Index)
55.62% ± 1.98
Latency (Vals Index)
368.74s
Cost/Test (Vals Index)
$0.75
Context Window
256k
Max Output Tokens
128k
Input Modality
Hyperparameter settings
Default Provider :
xAI
Some benchmarks may use different provider and parameters. Please refer to the benchmark page for more information.
Temperature
0.7
Top P
0.95
Top K
Default
Max Output Tokens
128,000
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)