Behind the Scenes of Vibe Code Bench
Last week, we released Vibe Code Bench. This is our most ambitious benchmark to-date. I hope it will provide a useful signal for researchers and vibe-coders alike.
Rayan Krishnan
Minimax M2.5 - another step forward for open-weight coding models
Vals AIFull results for GLM 5
Vals AIGLM 5 - the new frontier open-weight coding model
Vals AIFAB v1.1 Released!
Vals AIClaude Opus 4.6 is the new SOTA
Vals AIQwen 3 Max Thinking Evaluated on Vals Index!
Vals AIProofBench Released: Evaluating Formal Mathematical Reasoning
Vals AIKimi K2.5 Evaluated on (almost) all benchmarks!
Vals AIKimi K2.5 sets a new open-source standard
Vals AIResults for Terminal-Bench 2.0 Released!
Vals AI

