o3

Release Date: 4/16/2025

Accuracy (Average)

74.22%

Latency (Average)

49.45s

Avg. Cost (In/Out)

2 / 8

Context Window

200k

Max Output Tokens

100k

Input Modality

Hyperparameter settings
Default Provider : OpenAI

Temperature

Default

Top P

Default

Top K

Default

Max Output Tokens

100,000

Reasoning Effort

high

Benchmarks

Accuracy

Rankings

0.0%

± 0.96
42/ 96

0.0%

± 2.16
13/ 50

0.0%

± 1.87
28/ 50

0.0%

± 0.93
19/ 68

0.0%

± 3.31
23/ 48

0.0%

± 0.85
13/ 103

0.0%

± 1.39
36/ 95

0.0%

± 1.86
21/ 98

0.0%

± 1.03
17/ 102

0.0%

± 0.42
20/ 116

0.0%

± 0.18
6/ 95

0.0%

± 0.34
23/ 96

0.0%

± 0.95
19/ 65
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.