Vals AI News
Follow us on
Follow us on
Harvey's Legal Agent Benchmark Released
Vals AICode Migration Released
Vals AIKimi K2.7 Code evaluated across our coding benchmarks
Vals AIAnthropic's Claude Fable 5 evaluated across our benchmark suite
Vals AIPublic Benefits Bench Released
Vals AINVIDIA's Nemotron 3 Ultra evaluated across our benchmark suite
Vals AIAlibaba's Qwen 3.7 Plus evaluated on the Vals Index
Vals AIMiniMax's MiniMax M3 evaluated across our benchmark suite
Vals AIAnthropic's Claude Opus 4.8 evaluated across our benchmark suite
Vals AIResults for Terminal-Bench 2.1 Released!
Vals AI