Claude Sonnet 4 (Thinking)

Release Date: 5/22/2025

Accuracy (Average)

58.52%

Latency (Average)

242.94s

Avg. Cost (In/Out)

3 / 15

Context Window

200k

Max Output Tokens

64k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

64,000

Benchmarks

Accuracy

Rankings

0.0%

± 0.96
36/ 109

0.0%

± 1.94
41/ 61

0.0%

± 2.21
52/ 61

0.0%

± 0.98
65/ 78

0.0%

± 3.21
43/ 59

0.0%

± 0.88
47/ 115

0.0%

± 2.19
56/ 109

0.0%

± 1.14
39/ 54

0.0%

± 1.13
76/ 114

0.0%

± 0.46
45/ 112

0.0%

± 0.36
43/ 108

0.0%

± 1.04
38/ 74
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.