Samyak (@sams_jain) Twitter Tweets • TwiCopy

Abhinav Dutta

a year ago

🚨 Are LLM compression methods (𝘲𝘶𝘢𝘯𝘵𝘪𝘻𝘢𝘵𝘪𝘰𝘯, 𝘱𝘳𝘶𝘯𝘪𝘯𝘨, 𝘦𝘢𝘳𝘭𝘺 𝘦𝘹𝘪𝘵) too good to be true and are existing eval metrics sufficient? We've looked into it in our latest research at Microsoft Research 🧵 (1/n) arxiv.org/abs/2407.09141

thumb_up_off_alt18

chat_bubble_outline2

repeat7

shareShare

Abhishek Panigrahi

@abhishek_034

a year ago

Progressive distillation, where a student model learns from multiple checkpoints of the teacher, has been shown to improve the student–but why? We show it induces an implicit curriculum that accelerates training. Work w Bingbin Liu, Sadhika Malladi, Andrej Risteski, Surbhi Goel

thumb_up_off_alt92

chat_bubble_outline2

repeat26

shareShare

Ekdeep Singh Lubana

@ekdeepl

a year ago

Paper alert—accepted as a NeurIPS *Spotlight*!🧵👇 We build on our past work relating emergence to task compositionality and analyze the *learning dynamics* of such tasks: we find there exist latent interventions that can elicit them much before input prompting works! 🤯

thumb_up_off_alt609

chat_bubble_outline12

repeat92

shareShare

Usman Anwar

@usmananwar391

a year ago

Transformers are REALLY good at in-context learning (ICL); but do they learn ‘adversarially robust’ ICL algorithms? We study this and much more in our new paper! 🧵

thumb_up_off_alt72

chat_bubble_outline2

repeat14

shareShare

P Shravan Nayak

@pshravannayak

a year ago

Excited to be at #EMNLP2024! 🎉 Join my talk on CulturalVQA, a benchmark testing Vision Language Models’ grasp of cultural understanding. Let’s see if VLMs truly capture global perspectives—chat after! 🗓️ Nov 12 (Tue), 4:15-4:30 PM 📍 Flagler Paper: arxiv.org/abs/2407.10920

thumb_up_off_alt42

chat_bubble_outline0

repeat14

shareShare

Samyak

@sams_jain

a year ago

I'll be at NeurIPS Conference, sharing my work on understanding safety fine-tuning and jailbreaks. Visit our poster on Wed 11, Session-2 (#3306) Also super excited to discuss about my current work at Microsoft Research on understanding lottery ticket hypothesis. Please reach out to chat!

thumb_up_off_alt9

chat_bubble_outline0

repeat3

shareShare

Satwik Bhattamishra

@satwik1729

a year ago

Excited to head to NeurIPS Conference today! I'll be presenting our work on the representational capabilities of Transformers and RNNs/SSMs. If you're interested in meeting up to discuss research or chat, feel free to reach out via DM or email!

Excited to head to <a href="/NeurIPSConf/">NeurIPS Conference</a> today! I'll be presenting our work on the representational capabilities of Transformers and RNNs/SSMs. If you're interested in meeting up to discuss research or chat, feel free to reach out via DM or email!

thumb_up_off_alt33

chat_bubble_outline3

repeat7

shareShare

Samyak

@sams_jain

a year ago

Very interesting work with several real world applications including designing natural jailbreaks and augmenting safety fine-tuning datasets.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Samyak

@sams_jain

a year ago

Excellent work showcasing the ease of jailbreaking LLMs using natural prompts!!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Ekdeep Singh Lubana

@ekdeepl

a year ago

Paper alert––*Awarded best paper* at NeurIPS workshop on Foundation Model Interventions! 🧵👇 We analyze the (in)abilities of SAEs by relating them to the field of disentangled rep. learning, where limitations of AE based interpretability protocols have been well established!🤯

thumb_up_off_alt493

chat_bubble_outline6

repeat84

shareShare

iKDD

@ikdd_news

a year ago

IKDD congratulates Sravanti Addepalli IISc Bangalore for enhancing the robustness of Deep Neural Networks against adversarial attacks and distribution shifts while addressing practical deployment challenges. Preethi Jyothi Manish Gupta Amith Singhee

IKDD congratulates Sravanti Addepalli <a href="/iiscbangalore/">IISc Bangalore</a> for enhancing the robustness of Deep Neural Networks against adversarial attacks and distribution shifts while addressing practical deployment challenges.

<a href="/PreethiJyothi1/">Preethi Jyothi</a> <a href="/ManishGuptaMG1/">Manish Gupta</a> <a href="/asinghee1/">Amith Singhee</a>

thumb_up_off_alt42

chat_bubble_outline0

repeat8

shareShare

Andrew Lee

@a_jy_l

10 months ago

New paper 🥳🚨 Interested in inference-time scaling? In-context Learning? Mech Interp? LMs can solve novel in-context tasks, with sufficient examples (longer contexts). Why? Bcus they dynamically form *in-context representations*! 1/N

thumb_up_off_alt384

chat_bubble_outline6

repeat71

shareShare

Tim Rocktäschel

@_rockt

10 months ago

Proud to announce that Dr Robert Kirk defended his PhD thesis titled "Understanding and Evaluating Generalisation for Superhuman AI Systems" last week 🥳. Massive thanks to Roger Grosse and Sebastian Riedel (@[email protected]) for examining! As is customary, Rob received a personal mortarboard from

Proud to announce that Dr <a href="/_robertkirk/">Robert Kirk</a> defended his PhD thesis titled "Understanding and Evaluating
Generalisation for Superhuman AI Systems" last week 🥳. Massive thanks to <a href="/RogerGrosse/">Roger Grosse</a> and <a href="/riedelcastro/">Sebastian Riedel (@riedelcastro@sigmoid.social)</a> for examining! As is customary, Rob received a personal mortarboard from

thumb_up_off_alt120

chat_bubble_outline12

repeat8

shareShare

Samyak

@sams_jain

9 months ago

This work from my friend Pranav Nair seems very interesting! I wonder how this relates to recent work on scaling laws for precision: arxiv.org/pdf/2411.04330. Does this follow a similar trend still? I guess for 2 bit quantization it breaks the expected trend.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Ekdeep Singh Lubana

@ekdeepl

9 months ago

New paper–Accepted at #ICLR2025 and also my last PhD paper! 🧑‍🎓🧵👇 We propose a novel model of how emergent learning curves show up in neural nets’ training by making a connection to the theory of graph percolation!

thumb_up_off_alt185

chat_bubble_outline1

repeat28

shareShare

Sachin Yadav

@sachinyv

9 months ago

✨New Paper: Presenting Interleaved Gibbs Diffusion (IGD), a novel generative framework for mixed continuous-discrete data, focusing on constrained generation. From 3-SAT and molecule design to layout generation, IGD advances diffusion models by capturing complex inter-variable

thumb_up_off_alt37

chat_bubble_outline1

repeat10

shareShare

7vik

@satvikgolechha

7 months ago

We presented our poster at #ICLR2025 on the intricacies and challenges of studying the feature geometry of concept representations in LLMs! 💡📐 Runner-up best blog award at ICLR: iclr-blogposts.github.io/2025/about/ 1/5 🧵. .

thumb_up_off_alt17

chat_bubble_outline1

repeat4

shareShare

Andrew Lee

@a_jy_l

6 months ago

🚨New preprint! How do reasoning models verify their own CoT? We reverse-engineer LMs and find critical components and subspaces needed for self-verification! 1/n

thumb_up_off_alt270

chat_bubble_outline7

repeat51

shareShare

Ekdeep Singh Lubana

@ekdeepl

5 months ago

🚨 New paper alert! Linear representation hypothesis (LRH) argues concepts are encoded as **sparse sum of orthogonal directions**, motivating interpretability tools like SAEs. But what if some concepts don’t fit that mold? Would SAEs capture them? 🤔 1/11

thumb_up_off_alt378

chat_bubble_outline5

repeat60

shareShare

Ramneet Singh

@ramneet_singhh

5 months ago

Time to share what we’ve been up to for the past year. More to come, very excited about this direction of work!

thumb_up_off_alt136

chat_bubble_outline8

repeat7

shareShare