Accuracy (Average)

61.94%

Latency (Average)

493.46s

Avg. Cost (In/Out)

15 / 75

Context Window

200k

Max Output Tokens

32k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

32,000

Benchmarks

Accuracy

Rankings

0.0%

± 1.96
20/ 51

0.0%

± 2.02
41/ 51

0.0%

± 0.97
35/ 69

0.0%

± 3.02
37/ 49

0.0%

± 0.88
47/ 104

0.0%

± 1.68
65/ 96

0.0%

± 2.30
60/ 99

0.0%

± 4.91
21/ 50

0.0%

± 1.11
62/ 103

0.0%

± 0.45
21/ 116

0.0%

± 0.24
30/ 95

0.0%

± 0.33
13/ 97

0.0%

± 1.06
32/ 66
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.