lmsys.org (@lmsysorg) 's Twitter Profile
lmsys.org

@lmsysorg

Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 50+ LLMs (GPT-4/Claude/Gemini/Llamas) side-by-side at lmarena.ai

ID: 1641378826537295874

linkhttp://lmsys.org calendar_today30-03-2023 09:56:38

656 Tweet

54,54K Followers

178 Following

lmsys.org (@lmsysorg) 's Twitter Profile Photo

Does style matter over substance in Arena? Can models "game" human preference through lengthy and well-formatted responses? Today, we're launching style control in our regression model for Chatbot Arena — our first step in separating the impact of style from substance in

Does style matter over substance in Arena? Can models "game" human preference through lengthy and well-formatted responses?

Today, we're launching style control in our regression model for Chatbot Arena — our first step in separating the impact of style from substance in