Accuracy (Average)

60.67%

Latency (Average)

43.20s

Avg. Cost (In/Out)

0.9 / 0.9

Context Window

131k

Max Output Tokens

131k

Input Modality

Hyperparameter settings
Default Provider : DeepSeek

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

131,072

Show rankings only among open weight models

Benchmarks

Accuracy

Rankings

0.0%

± 0.98
57/ 96

0.0%

± 0.88
52/ 103

0.0%

± 1.71
59/ 95

0.0%

± 2.45
74/ 98

0.0%

± 0.90
46/ 50

0.0%

± 1.16
60/ 102

0.0%

± 0.42
68/ 116

0.0%

± 0.35
68/ 95

0.0%

± 0.40
57/ 96
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.