
tusharkhot
@tusharkhot
Senior Research Scientist, Allen Institute for AI
ID: 13091972
https://allenai.org/team/tushark/ 05-02-2008 13:05:09
105 Tweet
309 Followers
187 Following

I'll be presenting my #NAACL2024 work:✨ADaPT✨ in-person 🇲🇽 tomorrow (June 19) at 11 AM in poster session 7! ADaPT enables LLMs to "adapt" to task complexity & execution failures by decomposing recursively w/ Alexander Koller M Hartmann, P Clark Ashish Sabharwal Mohit Bansal tusharkhot

Can LLMs help accelerate the discovery of data-driven scientific hypotheses? 🧬📊 We benchmark this in DiscoveryBench: 264 discovery tasks from 6 scientific domains, from humanities to biology: arxiv.org/pdf/2407.01725… Ai2 Aristo Team at AI2 Harshit Surana UMass Amherst





AppWorld won the (one of the) best resource paper award(s) at #ACL2024 Outstanding resource and great work by Harsh Trivedi Niranjan at Stony Brook tusharkhot Ashish Sabharwal Aristo Team at AI2 and collaborators 🧵👇



🏆 AppWorld won a #ACL2024NLP Best Resource Paper Award. 🥳 Congrats team! I'm so happy for Harsh Trivedi. The time & care he put in is inspiring. #proudadvisor 🚨He is on the job market.🚨 Hire him! 🌐Check out appworld.dev Stony Brook University Dept. of Computer Science @AI_SBU #NLProc Ai2




Can language models perform end-to-end scientific discovery? In our NeurIPS Spotlight paper, we show: very rarely. Our best model found <20% of discoveries, our best PhDs found nearly all. Paper: arxiv.org/pdf/2406.06769 Code/Web: allenai.github.io/discoveryworld Ai2 Microsoft Research


For this week’s NLP Seminar, we are thrilled to host Harsh Trivedi to talk about AppWorld: Reliable Evaluation of Interactive Agents in a World of Apps and People! When: 10/10 Thurs 11am PT Non-Stanford affiliates registration form: forms.gle/UjWyX6dn7mQafj… (closed at 9am PT on






