Claude Opus 4.6 (Thinking)

Release Date: 2/5/2026

Vals Index

Accuracy (Vals Index)

66.00% ± 2.16

Latency (Vals Index)

605.30s

Cost/Test (Vals Index)

$2.50

Context Window

200k

Max Output Tokens

128k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

128,000

Compute Effort

max

Benchmarks

Accuracy

Rankings

0.0%

± 2.16
4/ 35

0.0%

± 1.70
4/ 28

0.0%

± 0.93
5/ 104

0.0%

± 2.08
11/ 56

0.0%

± 1.94
4/ 56

0.0%

± 0.91
9/ 73

0.0%

± 5.00
4/ 30

0.0%

± 3.34
5/ 54

0.0%

± 0.83
3/ 110

0.0%

± 4.68
7/ 43

0.0%

± 1.19
9/ 104

0.0%

± 1.02
18/ 109

0.0%

± 0.37
8/ 107

0.0%

± 0.46
4/ 103

0.0%

± 0.88
12/ 70

0.0%

± 1.85
5/ 47

0.0%

± 5.25
8/ 61
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.