Accuracy (Average)

62.23%

Latency (Average)

229.58s

Avg. Cost (In/Out)

3 / 15

Context Window

200k

Max Output Tokens

64k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

64,000

Benchmarks

Accuracy

Rankings

0.0%

± 0.96
26/ 97

0.0%

± 1.94
33/ 51

0.0%

± 2.21
44/ 51

0.0%

± 0.99
58/ 69

0.0%

± 3.21
35/ 49

0.0%

± 0.88
40/ 104

0.0%

± 1.43
51/ 96

0.0%

± 2.19
48/ 99

0.0%

± 1.14
35/ 50

0.0%

± 1.12
65/ 103

0.0%

± 0.45
37/ 116

0.0%

± 0.24
28/ 95

0.0%

± 0.36
34/ 97

0.0%

± 1.04
31/ 66
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.