
Adithya Bhaskar
@adithyanlp
Second Year CS Ph.D. student at Princeton University (@princeton_nlp), previously CS undergrad at IIT Bombay
ID: 1669231860130660352
http://adithyabh.github.io 15-06-2023 06:34:30
39 Tweet
226 Followers
245 Following

๐คจ Are Multimodal Large Language Models really as ๐ ๐จ๐จ๐ at ๐๐ก๐๐ซ๐ญ ๐ฎ๐ง๐๐๐ซ๐ฌ๐ญ๐๐ง๐๐ข๐ง๐ as existing benchmarks such as ChartQA suggest? ๐ซ Our โ๐๐๐ฃ๐๐๐ง benchmark suggests NO! ๐ฅHumans achieve โจ๐๐+% correctness. ๐ฅSonnet 3.5 outperforms GPT-4o by 10+ points,


How can we understand neural chatbots in terms of interpretable, symbolic mechanisms? To explore this question, we constructed a Transformer that implements the classic ELIZA chatbot algorithm (with Abhishek Panigrahi and Danqi Chen). Paper: arxiv.org/abs/2407.10949 (1/6)








