Claude Sonnet 4.5 (Thinking)
Release Date: 9/29/2025
Benchmarked by
Anthropic's latest flagship model
Accuracy (Vals Index)
60.43% ± 1.97
Latency (Vals Index)
302.46s
Cost/Test (Vals Index)
$0.76
Context Window
1M
Max Output Tokens
64k
Input Modality
Hyperparameter settings
Default Provider :
Anthropic
Some benchmarks may use different provider and parameters. Please refer to the benchmark page for more information.
Temperature
1
Top P
Default
Top K
Default
Max Output Tokens
64,000
Benchmarks
Accuracy
Rankings
Academic Benchmarks
Proprietary Benchmarks (contact us to get access)