
Ben Ank
@benankdev
developer relations @GroqInc ⚡️ bringing imAgInation to life with tech 📀
ID: 1289432475853180929
https://groq.com 01-08-2020 05:27:44
417 Tweet
327 Followers
258 Following















SemiAnalysis It’s unfortunate that we can’t trust model makers’ eval results. This is an issue we’re trying to solve with OpenBench - an easy standard way to run evals easily transparently. We just released v0.2.0 and we’ll be adding SWE-Bench-Verified shortly 🫡 github.com/groq/openbench


