Accuracy (Average)

71.05%

Latency (Average)

7.60s

Avg. Cost (In/Out)

3 / 15

Context Window

200k

Max Output Tokens

8k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

8,192

Benchmarks

Accuracy

Rankings

0.0%

± 1.77
9/ 80

0.0%

± 0.88
44/ 119

0.0%

± 2.35
79/ 113

0.0%

± 1.15
87/ 118

0.0%

± 0.48
63/ 116

0.0%

± 0.38
62/ 112

0.0%

± 1.08
51/ 76
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.