Accuracy (Vals Index)
57.88% ± 1.72
Latency (Vals Index)
719.39s
Cost/Test (Vals Index)
$0.36
Context Window
262k
Max Output Tokens
66k
Input Modality
Hyperparameter settings
Default Provider :
Alibaba
Some benchmarks may use different provider and parameters. Please refer to the benchmark page for more information.
Temperature
0.6
Top P
0.95
Top K
20
Max Output Tokens
65,536
Show rankings only among open weight models
Benchmarks
Accuracy
Rankings
Contact us
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks
Read about our methodology.