Accuracy (Vals Index)
64.11% ± 1.95
Latency (Vals Index)
444.60s
Cost/Test (Vals Index)
$0.78
Context Window
400k
Max Output Tokens
128k
Input Modality
Hyperparameter settings
Default Provider :
OpenAI
Some benchmarks may use different provider and parameters. Please refer to the benchmark page for more information.
Temperature
Default
Top P
Default
Top K
Default
Max Output Tokens
128,000
Reasoning Effort
xhigh
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)