Claude 3.7 Sonnet (Thinking)

Release Date: 2/24/2025

Accuracy (Average)

72.17%

Latency (Average)

54.80s

Avg. Cost (In/Out)

3 / 15

Context Window

200k

Max Output Tokens

8k

Input Modality

Hyperparameter settings
Default Provider : Anthropic

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

8,192

Benchmarks

Accuracy

Rankings

0.0%

± 0.96
47/ 108

0.0%

± 1.80
22/ 76

0.0%

± 0.86
25/ 114

0.0%

± 2.17
54/ 108

0.0%

± 1.14
77/ 113

0.0%

± 0.53
39/ 111

0.0%

± 0.37
50/ 107

0.0%

± 1.04
36/ 73
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.