Connor Chen (@connorzchen) 's Twitter Profile
Connor Chen

@connorzchen

CS @ Berkeley
Research @lmarena_ai

ID: 1763446697844514816

calendar_today01-03-2024 06:10:44

4 Tweet

8 Takipçi

54 Takip Edilen

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Introducing Prompt-to-leaderboard (P2L): a real-time LLM leaderboard tailored exactly to your use case! P2L trains an LLM to generate "prompt-specific" leaderboards, so you can input a prompt and get a leaderboard specifically for that prompt. The model is trained on the 2M

Anastasios Nikolas Angelopoulos (@ml_angelopoulos) 's Twitter Profile Photo

Prompt-to-Leaderboard is one of my favorite projects ever. "Which LLM is best for me and my use-case?" We train an LLM to take in prompts and output a vector of BT regression coefficients: one per model. By converting evaluation into learning, we benefit from scaling laws in

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

📢We’re excited to share that we’ve raised $100M in seed funding to support LMArena and continue our research on reliable AI. Led by a16z and UC Investments (University of California), we're proud to have the support of those that believe in both the science and the mission. We’re

Guillermo Rauch (@rauchg) 's Twitter Profile Photo

Just heard from the LMArena team that since launching on Vercel, usage is up 6.5× to 1.2 million human votes per month 😳 Arenas are such a critical resource to help devs understand real-world model performance. h/t Anastasios Nikolas Angelopoulos