Accuracy (Average)

54.69%

Latency (Average)

21.59s

Avg. Cost (In/Out)

3 / 15

Context Window

1M

Max Output Tokens

64k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

64,000

Benchmarks

Accuracy

Rankings

0.0%

± 0.96
33/ 96

0.0%

± 1.99
22/ 50

0.0%

± 1.93
7/ 50

0.0%

± 3.14
33/ 48
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.