Shruti Joshi (@_shruti_joshi_) Twitter Tweets • TwiCopy

Shruti Joshi

@_shruti_joshi_

+ Follow

phd student in identifiable repl @Mila_Quebec. prev. research programmer @MPI_IS Tübingen, undergrad @IITKanpur '19.

ID: 1026105870818529281

linkhttps://shrutij01.github.io/ calendar_today05-08-2018 14:01:18

176 Tweet

375 Followers

817 Following

Sébastien Lachapelle

@seblachap

2 years ago

1/ Excited for our oral presentation at #NeurIPS2023 on "Additive Decoders for Latent Variables Identification and Cartesian-Product Extrapolation"! A theoretical paper about object-centric representation learning (OCRL), disentanglement & extrapolation arxiv.org/abs/2307.02598

thumb_up_off_alt106

chat_bubble_outline4

repeat28

shareShare

Arkil Patel

@arkil_patel

2 years ago

Presenting tomorrow at #EMNLP2023: MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations w/ amazing advisors and collaborators 🇺🇦 Dzmitry Bahdanau, Siva Reddy, and Satwik Bhattamishra

thumb_up_off_alt45

chat_bubble_outline2

repeat17

shareShare

Nicholas Meade

@ncmeade

2 years ago

Adversarial Triggers For LLMs Are 𝗡𝗢𝗧 𝗨𝗻𝗶𝘃𝗲𝗿𝘀𝗮𝗹!😲 It is believed that adversarial triggers that jailbreak a model transfer universally to other models. But we show triggers don't reliably transfer, especially to RLHF/DPO models. Paper: arxiv.org/abs/2404.16020

thumb_up_off_alt99

chat_bubble_outline3

repeat32

shareShare

Arkil Patel

@arkil_patel

2 years ago

📢 Exciting new work on AI safety! Do adversarial triggers transfer universally across models (as has been claimed)? 𝗡𝗼. Are models aligned by supervised fine-tuning safe against adversarial triggers? 𝗡𝗼. RLHF and DPO are far better!

thumb_up_off_alt20

chat_bubble_outline0

repeat6

shareShare

Arkil Patel

@arkil_patel

a year ago

Presenting tomorrow at #NAACL2024: 𝐶𝑎𝑛 𝐿𝐿𝑀𝑠 𝑖𝑛-𝑐𝑜𝑛𝑡𝑒𝑥𝑡 𝑙𝑒𝑎𝑟𝑛 𝑡𝑜 𝑢𝑠𝑒 𝑛𝑒𝑤 𝑝𝑟𝑜𝑔𝑟𝑎𝑚𝑚𝑖𝑛𝑔 𝑙𝑖𝑏𝑟𝑎𝑟𝑖𝑒𝑠 𝑎𝑛𝑑 𝑙𝑎𝑛𝑔𝑢𝑎𝑔𝑒𝑠? 𝑌𝑒𝑠. 𝐾𝑖𝑛𝑑 𝑜𝑓. Internship Ai2 work with Pradeep Dasigi and my advisors 🇺🇦 Dzmitry Bahdanau and Siva Reddy.

Presenting tomorrow at #NAACL2024:

𝐶𝑎𝑛 𝐿𝐿𝑀𝑠 𝑖𝑛-𝑐𝑜𝑛𝑡𝑒𝑥𝑡 𝑙𝑒𝑎𝑟𝑛 𝑡𝑜 𝑢𝑠𝑒 𝑛𝑒𝑤 𝑝𝑟𝑜𝑔𝑟𝑎𝑚𝑚𝑖𝑛𝑔 𝑙𝑖𝑏𝑟𝑎𝑟𝑖𝑒𝑠 𝑎𝑛𝑑 𝑙𝑎𝑛𝑔𝑢𝑎𝑔𝑒𝑠?

𝑌𝑒𝑠. 𝐾𝑖𝑛𝑑 𝑜𝑓.

Internship <a href="/allen_ai/">Ai2</a> work with <a href="/pdasigi/">Pradeep Dasigi</a> and my advisors <a href="/DBahdanau/">🇺🇦 Dzmitry Bahdanau</a> and <a href="/sivareddyg/">Siva Reddy</a>.

thumb_up_off_alt74

chat_bubble_outline3

repeat20

shareShare

Leena C Vankadara

@leenacvankadara

a year ago

I am thrilled to announce that I will be joining the Gatsby Computational Neuroscience Unit at UCL as a Lecturer (Assistant Professor) in Feb 2025! Looking forward to working with the exceptional talent at Gatsby Computational Neuroscience Unit on cutting-edge problems in deep learning and causality.

thumb_up_off_alt66

chat_bubble_outline10

repeat6

shareShare

Tom Marty

@tom__marty

a year ago

🚨NEW PAPER OUT 🚨 Excited to share our latest research initiative on in-context learning and meta-learning through the lens of Information theory !🧠 🔗 arxiv.org/abs/2410.14086 Check out our insights and empirical experiments! 🔍

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Sahil Verma

@sahil1v

a year ago

📣 📣 📣 Our new paper investigates the question of how many images 🖼️ of a concept are required by a diffusion model 🤖 to imitate it. This question is critical for understanding and mitigating the copyright and privacy infringements of these models! arxiv.org/abs/2410.15002

thumb_up_off_alt225

chat_bubble_outline10

repeat61

shareShare

Arkil Patel

@arkil_patel

10 months ago

Presenting ✨ 𝐂𝐇𝐀𝐒𝐄: 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐧𝐠 𝐜𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐢𝐧𝐠 𝐬𝐲𝐧𝐭𝐡𝐞𝐭𝐢𝐜 𝐝𝐚𝐭𝐚 𝐟𝐨𝐫 𝐞𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧 ✨ Work w/ fantastic advisors 🇺🇦 Dzmitry Bahdanau and Siva Reddy Thread 🧵:

Presenting ✨ 𝐂𝐇𝐀𝐒𝐄: 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐧𝐠 𝐜𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐢𝐧𝐠 𝐬𝐲𝐧𝐭𝐡𝐞𝐭𝐢𝐜 𝐝𝐚𝐭𝐚 𝐟𝐨𝐫 𝐞𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧 ✨

Work w/ fantastic advisors <a href="/DBahdanau/">🇺🇦 Dzmitry Bahdanau</a> and <a href="/sivareddyg/">Siva Reddy</a>

Thread 🧵:

thumb_up_off_alt41

chat_bubble_outline1

repeat18

shareShare

Arkil Patel

@arkil_patel

8 months ago

𝐓𝐡𝐨𝐮𝐠𝐡𝐭𝐨𝐥𝐨𝐠𝐲 paper is out! 🔥🐋 We study the reasoning chains of DeepSeek-R1 across a variety of tasks and settings and find several surprising and interesting phenomena! Incredible effort by the entire team! 🌐: mcgill-nlp.github.io/thoughtology/

thumb_up_off_alt25

chat_bubble_outline0

repeat5

shareShare

Soumye Singhal

@soumyesinghal

8 months ago

⚡⚡ Llama-Nemotron-Ultra-253B just dropped: our most advanced open reasoning model 🧵👇

thumb_up_off_alt44

chat_bubble_outline3

repeat13

shareShare

Sahil Verma

@sahil1v

6 months ago

🚨 New Paper! 🚨 Guard models slow, language-specific, and modality-limited? Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀 arxiv.org/abs/2505.23856

thumb_up_off_alt73

chat_bubble_outline1

repeat33

shareShare