
Radek Bartyzal
@radekbartyzal
Recommendation Team Lead at GLAMI. Building production ML systems for millions of users.
ID: 1908291367
https://github.com/BartyzalRadek 26-09-2013 15:26:52
348 Tweet
179 Followers
170 Following








It is critical for scientific integrity that we trust our measure of progress. The lmarena.ai has become the go-to evaluation for AI progress. Our release today demonstrates the difficulty in maintaining fair evaluations on lmarena.ai, despite best intentions.






