GPT 5.1

Release Date: 11/13/2025

Accuracy (Average)

64.89%

Latency (Average)

377.46s

Avg. Cost (In/Out)

1.25 / 10

Context Window

400k

Max Output Tokens

128k

Input Modality

Hyperparameter settings
Default Provider : OpenAI

Temperature

Default

Top P

Default

Top K

Default

Max Output Tokens

128,000

Reasoning Effort

high

Benchmarks

Accuracy

Rankings

0.0%

± 0.95
23/ 108

0.0%

± 2.15
6/ 60

0.0%

± 1.94
1/ 60

0.0%

± 0.93
39/ 76

0.0%

± 3.16
24/ 57

0.0%

± 0.85
13/ 114

0.0%

± 4.25
19/ 47

0.0%

± 1.71
20/ 108

0.0%

± 7.34
13/ 54

0.0%

± 0.98
10/ 113

0.0%

± 0.40
7/ 111

0.0%

± 0.34
24/ 107

0.0%

± 0.90
17/ 73

0.0%

± 2.06
33/ 50
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.