Claude Sonnet 4.5 (Thinking)

Release Date: 9/29/2025

Accuracy (Average)

58.53%

Latency (Average)

1216.56s

Avg. Cost (In/Out)

3 / 15

Context Window

1M

Max Output Tokens

64k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

64,000

Benchmarks

Accuracy

Rankings

0.0%

± 0.96
34/ 113

0.0%

± 2.00
22/ 65

0.0%

± 1.87
13/ 64

0.0%

± 0.95
34/ 80

0.0%

± 3.92
17/ 39

0.0%

± 3.21
40/ 61

0.0%

± 0.86
34/ 119

0.0%

± 3.73
29/ 59

0.0%

± 2.25
39/ 113

0.0%

± 5.92
18/ 55

0.0%

± 1.13
61/ 118

0.0%

± 0.45
22/ 116

0.0%

± 0.39
15/ 112

0.0%

± 0.97
32/ 76

0.0%

± 2.05
34/ 56
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.