Accuracy (Vals Index)
63.87% ± 1.94
Latency (Vals Index)
435.62s
Cost/Test (Vals Index)
$0.32
Context Window
1M
Max Output Tokens
384k
Input Modality
Hyperparameter settings
Default Provider :
DeepSeek
Some benchmarks may use different provider and parameters. Please refer to the benchmark page for more information.
Temperature
1
Top P
0.95
Top K
Default
Max Output Tokens
384,000
Reasoning Effort
max
Show rankings only among open weight models
Benchmarks
Accuracy
Rankings
Contact us
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks
Read about our methodology.