GPT 5.1

Release Date: 11/13/2025

Vals Index

Accuracy (Vals Index)

60.38% ± 2.00

Latency (Vals Index)

411.84s

Cost/Test (Vals Index)

$0.36

Context Window

400k

Max Output Tokens

128k

Input Modality

Hyperparameter settings
Default Provider : OpenAI

Temperature

Default

Top P

Default

Top K

Default

Max Output Tokens

128,000

Reasoning Effort

high

Benchmarks

Accuracy

Rankings

0.0%

± 2.00
12/ 39

0.0%

± 1.57
10/ 27

0.0%

± 0.75
1/ 46

0.0%

± 0.95
16/ 96

0.0%

± 2.80
9/ 44

0.0%

± 2.15
3/ 50

0.0%

± 1.94
1/ 50

0.0%

± 0.93
32/ 68

0.0%

± 3.16
18/ 48

0.0%

± 0.85
9/ 103

0.0%

± 0.62
14/ 95

0.0%

± 1.71
12/ 98

0.0%

± 7.34
10/ 50

0.0%

± 0.98
5/ 102

0.0%

± 0.40
6/ 116

0.0%

± 0.17
2/ 95

0.0%

± 0.34
17/ 96

0.0%

± 0.90
12/ 65

0.0%

± 2.06
24/ 40

0.0%

± 5.30
16/ 51
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.