Siddharth Dalmia (@siddalmia05) Twitter Tweets • TwiCopy

ACL SIGEL

2 years ago

✨ComputEL-7 will be co-located with EACL 2024!✨ We are excited to see you in Malta! 📢📢The submission deadline is December 15, 2023. More info about the workshop and important dates: computel-workshop.org/computel-7/

thumb_up_off_alt16

chat_bubble_outline0

repeat9

shareShare

Shinji Watanabe

@shinjiw_at_cmu

2 years ago

Happy New Year! Last year, our group published over 70 papers (ICASSPx29, Interspeechx17, ASRUx12, etc.)! I'm very happy to work with my great colleagues. Thanks, everyone! (Note that I do not include some technical reports, arXiv, and workshop/challenge papers.)

thumb_up_off_alt128

chat_bubble_outline3

repeat11

shareShare

elvis

@omarsar0

2 years ago

LLM Augmented LLMs Yeah, you heard that right! This work explores composing existing foundation models with specific models to expand capabilities. Introduces cross-attention between models to compose representations that enable new capabilities. As an example, a PaLM2-S

thumb_up_off_alt441

chat_bubble_outline7

repeat114

shareShare

Samridhi Choudhary

@samridhishree

2 years ago

Super pumped to show what my incredible team in Amazon has been working on with BMW at #CES2024 ! We leverage our in-house AlexaLLM to bring you a much more conversational, informative and a fun BMW Car Expert. Experience it for yourself if you are at CES this week!

thumb_up_off_alt14

chat_bubble_outline2

repeat2

shareShare

Shikhar

@shikharssu

2 years ago

Is that speech English, Spanish, or Mandarin? 🕵️‍♂️ 🔊Super excited to share several works that improve language identification capabilities via both pre-training and fine-tuning. MuSeLI: Multimodal modeling for spoken language identification [ICASSP 2024] 📎

thumb_up_off_alt35

chat_bubble_outline1

repeat12

shareShare

Sabera Talukder

@saberatalukder

2 years ago

👀👀 poster session + Qun Liu's talk! William Wang Amy Zhang + Language Control Diffusion Rachit Bansal Siddharth Dalmia Nitish Gupta Sriram Ganapathy Prateek Jain Partha Talukdar LLM Augmented LLMs Sangwoo Mo Sukmin Yun Jung-Woo Ha Jinwoo Shin Hierarchical Context Merging

👀👀 poster session + <a href="/LiuQunMTtoDeath/">Qun Liu</a>'s talk!

<a href="/WilliamWangNLP/">William Wang</a> <a href="/yayitsamyzhang/">Amy Zhang</a> +
Language Control Diffusion

<a href="/rach_it_/">Rachit Bansal</a> <a href="/siddalmia05/">Siddharth Dalmia</a> <a href="/nitish_gup/">Nitish Gupta</a> <a href="/tweet4sri/">Sriram Ganapathy</a> <a href="/jainprateek_/">Prateek Jain</a> <a href="/partha_p_t/">Partha Talukdar</a>
LLM Augmented LLMs

<a href="/sangwoomo/">Sangwoo Mo</a> <a href="/seokmin_youn/">Sukmin Yun</a> <a href="/JungWooHa2/">Jung-Woo Ha</a> <a href="/JwShin_CHEM/">Jinwoo Shin</a>
Hierarchical Context Merging

thumb_up_off_alt16

chat_bubble_outline1

repeat4

shareShare

Rachit Bansal

@rach_it_

a year ago

I am pleased to share that I'll be joining Harvard University as a PhD student this Fall. Looking forward to work with David Alvarez Melis, Martin Wattenberg, Fernanda Viégas, et al. at SEAS! I'll be supported by a Kempner Institute at Harvard University fellowship, and am keen to further our understanding & usability of large ML models!

thumb_up_off_alt840

chat_bubble_outline41

repeat17

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

a year ago

Exciting News from Chatbot Arena! Google DeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive

Exciting News from Chatbot Arena!

<a href="/GoogleDeepMind/">Google DeepMind</a>'s new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes.

For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive

thumb_up_off_alt1,1K

chat_bubble_outline83

repeat410

shareShare

Kiran Vodrahalli ([email protected])

@kiranvodrahalli

a year ago

Happy to share Michelangelo (arxiv.org/abs/2409.12640), a long-context reasoning benchmark which measures performance beyond needle tasks up to arbitrary context lengths and remains challenging for frontier models. Stay tuned for more Michelangelo evals to come!

thumb_up_off_alt358

chat_bubble_outline3

repeat56

shareShare

WiNLP

@winlpworkshop

a year ago

🌊 Sail with us at #WiNLP2024! 🌊 Join panel "Sailing the NLP Seas: Navigating Research in the Age of LLMs" on Nov 15, 11:00 AM - 12:00 PM with Abhilasha Ravichander, @Sunayana , Isabelle Augenstein, Lu Wang, Mrinmaya Sachan will dive into the evolving tides of NLP in the LLM era. ⚓️ #EMNLP2024

🌊 Sail with us at #WiNLP2024! 🌊

Join panel "Sailing the NLP Seas: Navigating Research in the Age of LLMs" on Nov 15, 11:00 AM - 12:00 PM with <a href="/lasha_nlp/">Abhilasha Ravichander</a>, @Sunayana , <a href="/IAugenstein/">Isabelle Augenstein</a>, <a href="/LuWang__/">Lu Wang</a>, <a href="/mrinmayasachan/">Mrinmaya Sachan</a> will dive into the evolving tides of NLP in the LLM era. ⚓️
#EMNLP2024

thumb_up_off_alt25

chat_bubble_outline0

repeat10

shareShare

Abhilasha Ravichander

@lasha_nlp

a year ago

✨I’m on the faculty job market for 2024-2025! ✨ My research focuses on advancing Responsible AI—enhancing factuality, robustness, and transparency in AI systems. I’m at #EMNLP2024 this week🌴 and would love to chat about research and hear any advice!

thumb_up_off_alt224

chat_bubble_outline1

repeat46

shareShare

Abhilasha Ravichander

@lasha_nlp

9 months ago

We are launching HALoGEN💡, a way to systematically study *when* and *why* LLMs still hallucinate. New work w/ Shrusti Ghela* David Wadden Yejin Choi 💫 🧵 [1/n]

We are launching HALoGEN💡, a way to systematically study *when* and *why* LLMs still hallucinate.

New work w/ <a href="/shrusti_ghela/">Shrusti Ghela</a>* <a href="/davidjwadden/">David Wadden</a> <a href="/YejinChoinka/">Yejin Choi</a> 💫

🧵 [1/n]

thumb_up_off_alt165

chat_bubble_outline1

repeat40

shareShare

Siddharth Dalmia

@siddalmia05

9 months ago

Interesting and thoughtful work on tracing LLM hallucinations to pretraining corpora!!!

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Abhilasha Ravichander

@lasha_nlp

8 months ago

Want to know what training data has been memorized by models like GPT-4? We propose information-guided probes, a method to uncover memorization evidence in *completely black-box* models, without requiring access to 🙅‍♀️ Model weights 🙅‍♀️ Training data 🙅‍♀️ Token probabilities 🧵1/5

thumb_up_off_alt210

chat_bubble_outline4

repeat40

shareShare

Siddharth Dalmia

@siddalmia05

7 months ago

Started a new role at WaveForms AI, founded by Alexis Conneau and Coralie Lemaitre (waveforms.ai). I am excited to be working with a fantastic team of AI dreamers building the future of audio LLMs. Ready to give form to the coming wave of audio intelligence. 🔊🌊🧠

Started a new role at <a href="/WaveFormsAI/">WaveForms AI</a>, founded by <a href="/alex_conneau/">Alexis Conneau</a> and Coralie Lemaitre (waveforms.ai). I am excited to be working with a fantastic team of AI dreamers building the future of audio LLMs.

Ready to give form to the coming wave of audio intelligence. 🔊🌊🧠

thumb_up_off_alt71

chat_bubble_outline6

repeat5

shareShare

Abhilasha Ravichander

@lasha_nlp

4 months ago

Life update: I’m excited to share that I’ll be starting as faculty at the Max Planck Institute for Software Systems(Max Planck Institute for Software Systems) this Fall!🎉 I’ll be recruiting PhD students in the upcoming cycle, as well as research interns throughout the year: lasharavichander.github.io/contact.html

Life update: I’m excited to share that I’ll be starting as faculty at the Max Planck Institute for Software Systems(<a href="/mpi_sws_/">Max Planck Institute for Software Systems</a>) this Fall!🎉

I’ll be recruiting PhD students in the upcoming cycle, as well as research interns throughout the year: lasharavichander.github.io/contact.html

thumb_up_off_alt505

chat_bubble_outline75

repeat43

shareShare

Abhilasha Ravichander

@lasha_nlp

3 months ago

Super thrilled that HALoGEN, our study of LLM hallucinations and their potential origins in training data, received an Outstanding Paper Award at ACL! Joint work w/i Shrusti Ghela*, and David Wadden Yejin Choi 💫

thumb_up_off_alt178

chat_bubble_outline23

repeat20

shareShare