Claude Sonnet 4.5 (Thinking)

Release Date: 9/29/2025

Vals Index

Accuracy (Vals Index)

59.88% ± 1.97

Latency (Vals Index)

276.83s

Cost/Test (Vals Index)

$0.66

Context Window

1M

Max Output Tokens

64k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

64,000

Benchmarks

Accuracy

Rankings

0.0%

± 1.97
14/ 40

0.0%

± 1.55
13/ 28

0.0%

± 1.55
17/ 47

0.0%

± 0.96
23/ 97

0.0%

± 2.86
13/ 45

0.0%

± 2.00
17/ 51

0.0%

± 1.87
9/ 51

0.0%

± 0.96
25/ 69

0.0%

± 3.94
10/ 24

0.0%

± 3.21
30/ 49

0.0%

± 0.86
27/ 104

0.0%

± 0.69
30/ 96

0.0%

± 2.25
28/ 99

0.0%

± 5.92
14/ 50

0.0%

± 1.13
46/ 103

0.0%

± 0.45
15/ 116

0.0%

± 0.20
15/ 95

0.0%

± 0.39
9/ 97

0.0%

± 0.97
23/ 66

0.0%

± 2.05
23/ 41

0.0%

± 5.25
18/ 52
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.