Claude Opus 4.1 (Thinking)

Release Date: 8/5/2025

Accuracy (Average)

65.50%

Latency (Average)

37.74s

Avg. Cost (In/Out)

15 / 75

Context Window

200k

Max Output Tokens

32k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

32,000

Benchmarks

Accuracy

Rankings

0.0%

± 2.07
18/ 60

0.0%

± 1.97
40/ 60

0.0%

± 0.99
55/ 76

0.0%

± 3.19
47/ 57

0.0%

± 0.86
29/ 114

0.0%

± 2.16
51/ 108

0.0%

± 1.13
69/ 113

0.0%

± 0.33
10/ 107

0.0%

± 1.00
32/ 73
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.