Aswathy Ajith (@aswathy__ajith) Twitter Tweets • TwiCopy

Marc Marone

3 years ago

We’ve all wondered “is this text already in my LLM training data? 🤔” Want a fast and compact way to check data membership? Introducing Data Portraits 🖼️ Work w/ Benjamin Van Durme Demo: dataportraits.org

thumb_up_off_alt277

chat_bubble_outline5

repeat75

shareShare

Jason Wei

@_jasonwei

3 years ago

I’m hearing chatter of PhD students not knowing what to work on. My take: as LLMs are deployed IRL, the importance of studying how to use them will increase. Some good directions IMO (no training): 1. prompting 2. evals 3. LM interfaces 4. safety 5. understanding LMs 6. emergence

thumb_up_off_alt1,1K

chat_bubble_outline50

repeat276

shareShare

Allyson Ettinger

@allysonettinger

3 years ago

Happy to share the news of this Best Paper Award for the COMPS paper (arxiv.org/abs/2210.01963). Congratulations Kanishka!

thumb_up_off_alt29

chat_bubble_outline0

repeat6

shareShare

Globus Labs

@labsglobus

3 years ago

Hong, Aswathy, Greg, Eamon, Kyle and Ian's paper "The Diminishing Returns of Masked Language Models to Science”, has been accepted to the Findings of ACL 2023 (doi.org/10.48550/arXiv…)!📜

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Mourad Heddaya

@mouradheddaya

2 years ago

How does natural language shape bilateral bargaining? To study this, we contribute a dataset of spoken natural language (NL) negotiations & a control without NL. We find that NL promotes cooperation. Being patient and reactive is an effective strategy. 📄arxiv.org/abs/2306.07117

thumb_up_off_alt27

chat_bubble_outline1

repeat12

shareShare

Kanishka Misra 🌊

@kanishkamisra

2 years ago

Our paper on analyzing language modeling acceptability judgments with systematically manipulated contexts was recognized as an outstanding paper—thanks so much to the reviewers and the best paper award committee!

thumb_up_off_alt155

chat_bubble_outline14

repeat11

shareShare

Jason Wei

@_jasonwei

2 years ago

In the past weeks I received many questions (from undergrads especially) about AI research, so I'm putting together a "Ask Me Anything" doc. Add any questions to the doc, I'll answer all of them: docs.google.com/document/d/1QR… Yes, I'll actually answer them all, because writing answers

thumb_up_off_alt729

chat_bubble_outline17

repeat142

shareShare

Mansi Sakarvadia

@mansi__s

2 years ago

Excited to be attending #EMNLP2023 in 🇸🇬! Will be giving an oral presenting my paper about localizing and rectifying multi-hop reasoning failures in LMs at BlackboxNLP on Dec 7th at (2:00-3:30 pm). Please stop by and say 👋/DM if you want to connect! msakarvadia.github.io/memory_injecti…

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Aryan Shrivastava

@aryan_shri123

5 months ago

🤫Jailbreak prompts make aligned LMs produce harmful responses.🤔But is that info linearly decodable? ↗️We show many refused concepts are linearly represented, sometimes persist through instruction-tuning, and may also shape downstream behavior❗ arxiv.org/abs/2507.00239 🧵1/

thumb_up_off_alt19

chat_bubble_outline1

repeat8

shareShare

Ari Holtzman

@universeinanegg

5 months ago

Prompting is our most successful tool for exploring LLMs, but the term evokes eye-rolls and grimaces from scientists. Why? Because prompting as scientific inquiry has become conflated with prompt engineering. This is holding us back. 🧵and new paper: arxiv.org/abs/2507.00163

thumb_up_off_alt160

chat_bubble_outline6

repeat30

shareShare

Dang Nguyen

@divingwithorcas

2 months ago

HR Simulator™: a game where you gaslight, deflect, and “let’s circle back” your way to victory. Every email a boss fight, every “per my last message” a critical hit… or maybe you just overplayed your hand 🫠 Can you earn Enlightened Bureaucrat status? (link below)

thumb_up_off_alt35

chat_bubble_outline2

repeat11

shareShare

Ari Holtzman

@universeinanegg

2 months ago

For those who missed it, we just releaaed a little LLM-backed game called HR Simulator™ You play an intern ghostwriting emails for your boss. It’s like you’re stuck in corporate email hell…and you’re the devil 😈 link and an initial answer to “WHY WOULD YOU DO THIS?” below

thumb_up_off_alt63

chat_bubble_outline3

repeat19

shareShare

Xiaoyan Bai

@elenal3ai

2 months ago

An interesting collaboration and mentoring experience -- this time with AIs🤖. Check out the paper written by our AI scientists: openreview.net/forum?id=sVaRg…

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Xiaoyan Bai

@elenal3ai

a month ago

❓ Does an LLM know thyself? 🪞 Humans pass the mirror test at ~18 months 👶 But what about LLMs? Can they recognize their own writing — or even admit authorship at all? In our new paper, we put 10 state-of-the-art models to the test. Read on 👇 1/n 🧵

thumb_up_off_alt45

chat_bubble_outline2

repeat18

shareShare

Ari Holtzman

@universeinanegg

a month ago

Redesigning an LLM course that goes from perceptrons 👁️ to the geometry of the residual stream 📐! Heavily weighting in-class assessments, due to AI tools. (2024 was project-focused) Would be grateful for any exams/etc. folks can share, this is new territory for me! Please♻️!!

thumb_up_off_alt38

chat_bubble_outline2

repeat11

shareShare

Haokun Liu

@haokunliu5280

23 days ago

We're launching a weekly competition where the community decides which research ideas get implemented. Every week, we'll take the top 3 ideas from IdeaHub, run experiments with AI agents, and share everything: code, findings, all the successes and failures. It's completely free

thumb_up_off_alt33

chat_bubble_outline3

repeat19

shareShare