Accuracy (Average)

60.77%

Latency (Average)

579.21s

Avg. Cost (In/Out)

15 / 75

Context Window

200k

Max Output Tokens

32k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

32,000

Benchmarks

Accuracy

Rankings

0.0%

± 1.96
24/ 62

0.0%

± 2.02
49/ 61

0.0%

± 0.97
43/ 78

0.0%

± 3.02
45/ 59

0.0%

± 0.88
55/ 116

0.0%

± 2.30
70/ 110

0.0%

± 4.91
25/ 54

0.0%

± 1.11
74/ 115

0.0%

± 0.45
29/ 113

0.0%

± 0.33
19/ 109

0.0%

± 1.06
39/ 74
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.