Accuracy (Average)

51.26%

Latency (Average)

98.14s

Avg. Cost (In/Out)

0.22 / 0.88

Context Window

1M

Max Output Tokens

16k

Input Modality

Hyperparameter settings
Default Provider : Meta

Temperature

Default

Top P

Default

Top K

Default

Max Output Tokens

16,384

Show rankings only among open weight models

Benchmarks

Accuracy

Rankings

0.0%

± 0.98
73/ 96

0.0%

± 1.99
31/ 50

0.0%

± 1.87
49/ 50

0.0%

± 0.97
41/ 68

0.0%

± 2.93
30/ 48

0.0%

± 0.92
72/ 103

0.0%

± 0.78
77/ 95

0.0%

± 2.35
60/ 98

0.0%

± 1.17
78/ 102

0.0%

± 0.42
66/ 116

0.0%

± 0.45
94/ 95

0.0%

± 0.40
59/ 96

0.0%

± 1.08
40/ 65

0.0%

± 1.58
51/ 51
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.