BlogBenchmark
11/26/2025Behind the Scenes of Vibe Code Bench
Last week, we released Vibe Code Bench. This is our most ambitious benchmark to-date. I hope it will provide a useful signal for researchers and vibe-coders alike.
Rayan Krishnan
Model benchmarks are seriously lacking. With Vals AI, we report how language models perform on the industry-specific tasks where they will be used.
By subscribing, I agree to Vals' Privacy Policy.