Blair Yang (@blairyang12) 's Twitter Profile
Blair Yang

@blairyang12

ID: 1063873082169864192

calendar_today17-11-2018 19:14:43

6 Tweet

34 Takipçi

160 Takip Edilen

Michael Zhang (@michaelrzhang) 's Twitter Profile Photo

📝 How do you choose which language model to use? Quantitative benchmarks can be uninformative and fall prey to Goodhart's Law, and even Chatbot Arena performance can be optimized for. In our new preprint, we propose generating qualitative report cards... 🧵

📝 How do you choose which language model to use? Quantitative benchmarks can be uninformative and fall prey to Goodhart's Law, and even Chatbot Arena performance can be optimized for.

In our new preprint, we propose generating qualitative report cards... 🧵
Michael Zhang (@michaelrzhang) 's Twitter Profile Photo

You can now listen to an digestible AI podcast about a research paper on AI grading AIs The new paper -> podcast quality is wild