Independent Evaluation, Unbiased Benchmarks

Testing AI on Real-World Tasks

We benchmark the world's leading AI models on rigorous, domain-specific tasks in finance, law, software, healthcare, and more. We run all of our own evaluations and create many of our benchmarks in-house.

Vals AI Updates

Fresh updates from our testing queue

model
06/13/2026

Kimi K2.7 Code evaluated across our coding benchmarks

Kimi K2.7 Code evaluated across our coding benchmarks

View Details

Benchmarks

Accuracy

Rankings

47.21%

± 5.19
18/ 62

82.05%

± 1.07
39/ 121

0.00%

± 0.00
13/ 23

78.20%

± 1.85
9/ 59

67.04%

± 0.38
7/ 30
Contact us
Or send us an email at contact@vals.ai
Proprietary Benchmarks (contact us to get access)
Academic Benchmarks

Read about our methodology.

Industry Leaderboard

Independent benchmarks for industry-specific AI performance.

Industry
Benchmark