Accuracy (Average)

61.21%

Latency (Average)

9.69s

Avg. Cost (In/Out)

3 / 15

Context Window

200k

Max Output Tokens

8k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

8,192

Benchmarks

Accuracy

Rankings

0.0%

± 0.98
63/ 97

0.0%

± 1.80
24/ 69

0.0%

± 0.90
58/ 104

0.0%

± 0.94
89/ 96

0.0%

± 2.47
76/ 99

0.0%

± 1.14
78/ 103

0.0%

± 0.42
74/ 116

0.0%

± 0.67
64/ 95

0.0%

± 0.40
67/ 97

0.0%

± 1.11
46/ 66
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.