GPT 5.1

Release Date: 11/13/2025

Accuracy (Average)

64.89%

Latency (Average)

377.46s

Avg. Cost (In/Out)

1.25 / 10

Context Window

400k

Max Output Tokens

128k

Input Modality

Hyperparameter settings
Default Provider : OpenAI

Temperature

Default

Top P

Default

Top K

Default

Max Output Tokens

128,000

Reasoning Effort

high

Benchmarks

Accuracy

Rankings

0.0%

± 0.95
22/ 105

0.0%

± 2.15
5/ 57

0.0%

± 1.94
1/ 57

0.0%

± 0.93
38/ 74

0.0%

± 3.16
23/ 55

0.0%

± 0.85
11/ 111

0.0%

± 4.25
18/ 44

0.0%

± 1.71
18/ 105

0.0%

± 7.34
12/ 53

0.0%

± 0.98
8/ 110

0.0%

± 0.40
7/ 108

0.0%

± 0.34
22/ 104

0.0%

± 0.90
16/ 71

0.0%

± 2.06
32/ 48

0.0%

± 5.30
22/ 62
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.