Siddharth Dalmia
@siddalmia05
Research Scientist @GoogleDeepmind | #SpeechProc and #NLProc | PhD from @LTIatCMU @SCSatCMU | Ex-intern @GoogleAI, @AWSCloud, @FacebookAI
ID: 775715067325284352
https://www.cs.cmu.edu/~sdalmia 13-09-2016 15:17:39
243 Tweet
1,1K Followers
447 Following
ππ poster session + Qun Liu's talk! William Wang Amy Zhang + Language Control Diffusion Rachit Bansal Siddharth Dalmia Nitish Gupta Sriram Ganapathy Prateek Jain Partha Talukdar LLM Augmented LLMs Sangwoo Mo Sukmin Yun Jung-Woo Ha Jinwoo Shin Hierarchical Context Merging
I am pleased to share that I'll be joining Harvard University as a PhD student this Fall. Looking forward to work with David Alvarez Melis, Martin Wattenberg, Fernanda ViΓ©gas, et al. at SEAS! I'll be supported by a Kempner Institute at Harvard University fellowship, and am keen to further our understanding & usability of large ML models!
Exciting News from Chatbot Arena! Google DeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive
π Sail with us at #WiNLP2024! π Join panel "Sailing the NLP Seas: Navigating Research in the Age of LLMs" on Nov 15, 11:00 AM - 12:00 PM with Abhilasha Ravichander, @Sunayana , Isabelle Augenstein, Lu Wang, Mrinmaya Sachan will dive into the evolving tides of NLP in the LLM era. βοΈ #EMNLP2024
We are launching HALoGENπ‘, a way to systematically study *when* and *why* LLMs still hallucinate. New work w/ Shrusti Ghela* David Wadden Yejin Choi π« π§΅ [1/n]
Want to know what training data has been memorized by models like GPT-4? We propose information-guided probes, a method to uncover memorization evidence in *completely black-box* models, without requiring access to π ββοΈ Model weights π ββοΈ Training data π ββοΈ Token probabilities π§΅1/5
Started a new role at WaveForms AI, founded by Alexis Conneau and Coralie Lemaitre (waveforms.ai). I am excited to be working with a fantastic team of AI dreamers building the future of audio LLMs. Ready to give form to the coming wave of audio intelligence. πππ§
Life update: Iβm excited to share that Iβll be starting as faculty at the Max Planck Institute for Software Systems(Max Planck Institute for Software Systems) this Fall!π Iβll be recruiting PhD students in the upcoming cycle, as well as research interns throughout the year: lasharavichander.github.io/contact.html
Super thrilled that HALoGEN, our study of LLM hallucinations and their potential origins in training data, received an Outstanding Paper Award at ACL! Joint work w/i Shrusti Ghela*, and David Wadden Yejin Choi π«