
Jakub Macina
@dmacjam
AI/ML Scientist, mountain biker
ID: 1557692280
http://macina.sk 30-06-2013 10:06:20
101 Tweet
204 Takipรงi
497 Takip Edilen

๐ ๐๐จ๐ฐ ๐ฐ๐๐ฅ๐ฅ ๐๐๐ง ๐๐๐๐ฌ ๐ญ๐๐๐๐ก? Evaluating LLMs for education is key to making real progress, yet we lack a reliable and simple benchmark. Introducing ๐๐๐ญ๐ก๐๐ฎ๐ญ๐จ๐ซ๐๐๐ง๐๐กโan open-source benchmark designed to assess holistic tutoring capabilities in AI.