DeepSeek V4

Release date

Models

Release Date: 4/23/2026

Accuracy (Vals Index)

55.62% ± 1.69

Latency (Vals Index)

1326.91s

Cost/Test (Vals Index)

$0.83

Context Window

1M

Max Output Tokens

384k

Input Modality

Hyperparameter settings

Default Provider : DeepSeek

Temperature

1

Top P

Default

Top K

Default

Max Output Tokens

384,000

Reasoning Effort

max

Show rankings only among open weight models

Benchmarks

Accuracy

Rankings

0.0%

± 1.69

18/ 39

0.0%

± 4.04

17/ 32

0.0%

± 0.96

47/ 125

0.0%

± 5.84

11/ 19

0.0%

± 3.05

19/ 27

Finance Agent (v2)

0.0%

± 0.65

23/ 39

Legal Research Bench

0.0%

± 2.93

19/ 26

0.0%

± 2.12

39/ 75

0.0%

± 2.00

51/ 74

0.0%

± 3.00

38/ 52

Public Benefits Bench v1.1

0.0%

± 1.26

7/ 22

0.0%

± 0.88

59/ 131

Vibe Code Bench v1.1

0.0%

± 4.77

24/ 75

Harvey's Legal Agent Benchmark

0.0%

± 1.43

10/ 26

0.0%

± 1.65

22/ 125

0.0%

± 7.14

12/ 59

0.0%

± 0.95

7/ 130

0.0%

± 0.47

70/ 128

0.0%

± 0.34

25/ 124

0.0%

± 0.00

24/ 33

0.0%

± 4.60

13/ 19

0.0%

± 1.87

23/ 74

Terminal-Bench 2.1

0.0%

± 1.50

34/ 44

Contact us

License type:

Proprietary (contact us to get access)

Industry Partner

Academic

Read our methodology.