Deedy (@deedydas) 's Twitter Profile
Deedy

@deedydas

VC at @MenloVentures. Formerly founding team @glean, @Google Search. @Cornell CS. Tweets about tech, immigration, India, fitness and search.

ID: 361044311

linkhttp://debarghyadas.com calendar_today24-08-2011 04:46:48

14,14K Tweet

166,166K Takipçi

4,4K Takip Edilen

Deedy (@deedydas) 's Twitter Profile Photo

LLMs are far worse at competitive programming than we thought. Every one scored 0% on Hard problems. LiveCodeBench-Pro is a new benchmark with 584 always updating problems from IOI, ICPC and Codeforces. What's most interesting is the categories they perform really poorly on:

LLMs are far worse at competitive programming than we thought. Every one scored 0% on Hard problems.

LiveCodeBench-Pro is a new benchmark with 584 always updating problems from IOI, ICPC and Codeforces.

What's most interesting is the categories they perform really poorly on: