Claude Opus 4.5 (Thinking)

Release Date: 11/24/2025

Vals Index

Accuracy (Vals Index)

62.93% ± 1.98

Latency (Vals Index)

245.14s

Cost/Test (Vals Index)

$0.91

Context Window

200k

Max Output Tokens

64k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

64,000

Compute Effort

high

Benchmarks

Accuracy

Rankings

0.0%

± 1.98
12/ 45

0.0%

± 1.55
9/ 30

0.0%

± 0.11
17/ 50

0.0%

± 0.94
16/ 100

0.0%

± 2.81
8/ 49

0.0%

± 2.01
10/ 54

0.0%

± 1.90
6/ 54

0.0%

± 0.92
14/ 72

0.0%

± 4.82
6/ 28

0.0%

± 3.40
3/ 51

0.0%

± 0.85
12/ 107

0.0%

± 0.00
22/ 40

0.0%

± 0.37
12/ 96

0.0%

± 2.48
18/ 102

0.0%

± 5.16
13/ 52

0.0%

± 1.04
23/ 108

0.0%

± 0.39
11/ 119

0.0%

± 0.18
10/ 95

0.0%

± 0.38
14/ 101

0.0%

± 0.90
16/ 69

0.0%

± 1.90
9/ 44

0.0%

± 5.31
13/ 56
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.