Claude Opus 4.6 (Thinking)

Release Date: 2/5/2026

Vals Index

Accuracy (Vals Index)

65.88% ± 1.94

Latency (Vals Index)

334.54s

Cost/Test (Vals Index)

$0.89

Context Window

200k

Max Output Tokens

128k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

128,000

Compute Effort

max

Benchmarks

Accuracy

Rankings

0.0%

± 1.94
2/ 39

0.0%

± 1.53
3/ 27

0.0%

± 0.37
17/ 46

0.0%

± 0.93
3/ 96

0.0%

± 2.78
3/ 44

0.0%

± 2.09
10/ 50

0.0%

± 1.94
3/ 50

0.0%

± 0.91
6/ 68

0.0%

± 5.03
2/ 23

0.0%

± 3.34
4/ 48

0.0%

± 0.83
3/ 103

0.0%

± 4.68
5/ 25

0.0%

± 0.64
8/ 95

0.0%

± 1.19
6/ 98

0.0%

± 1.02
13/ 102

0.0%

± 0.37
7/ 116

0.0%

± 0.19
12/ 95

0.0%

± 0.45
3/ 96

0.0%

± 0.88
9/ 65

0.0%

± 1.85
3/ 40

0.0%

± 5.25
6/ 51
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.