Jacob Andreas (@jacobandreas) Twitter Tweets • TwiCopy

Leshem Choshen C U @ ICLR 🤖🤗

a year ago

Human feedback is critical for aligning LLMs, so why don’t we collect it in the open ecosystem?🧐 We (15 orgs) gathered the key issues and next steps. Envisioning a community-driven feedback platform, like Wikipedia alphaxiv.org/abs/2408.16961 🧵

thumb_up_off_alt192

chat_bubble_outline2

repeat50

shareShare

Transluce

@transluceai

9 months ago

To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇

thumb_up_off_alt330

chat_bubble_outline9

repeat66

shareShare

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

9 months ago

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability & analysis, so I couldn't be more pumped to join a burgeoning supergroup w/ Najoung Kim 🫠 Aaron Mueller. Looking for my first students, so apply and reach out!

thumb_up_off_alt430

chat_bubble_outline46

repeat23

shareShare

MIT NLP

@nlp_mit

9 months ago

Hello everyone! We are quite a bit late to the twitter party, but welcome to the MIT NLP Group account! follow along for the latest research from our labs as we dive deep into language, learning, and logic 🤖📚🧠

thumb_up_off_alt543

chat_bubble_outline26

repeat52

shareShare

Bairu Hou

@hou_bairu

8 months ago

1/ Long chain-of-thought (CoT) reasoning boosts LLM performance—but with a computational overhead. Checkout our new paper, ThinkPrune, where we explore a simple question: To what extent can we cut the reasoning length while keep the quality? We show that by simply adding a hard

thumb_up_off_alt119

chat_bubble_outline4

repeat19

shareShare

AK

@_akhaliq

8 months ago

Self-Steering Language Models

thumb_up_off_alt325

chat_bubble_outline6

repeat49

shareShare

Ġabe Ġrand

@gabe_grand

8 months ago

New preprint is live! Tweet thread coming 🚧🔜 📅 Excited to present this work in-person: - 4/11: Poster at New England NLP (NENLP) 2025, Yale University (tomorrow!) - 4/27: Oral talk at VerifAI@ICLR 2025, Singapore

thumb_up_off_alt49

chat_bubble_outline1

repeat11

shareShare

Ben Lipkin

@ben_lipkin

8 months ago

New preprint on controlled generation from LMs! I'll be presenting at NENLP tomorrow 12:50-2:00pm Longer thread coming soon :)

thumb_up_off_alt67

chat_bubble_outline3

repeat11

shareShare

Ġabe Ġrand

@gabe_grand

8 months ago

Tackling complex problems with LMs requires search/planning, but how should test-time compute be structured? Introducing Self-Steering, a new meta-reasoning framework where LMs coordinate their own inference procedures by writing code!

thumb_up_off_alt108

chat_bubble_outline7

repeat37

shareShare

Ekin Akyürek

@akyurekekin

8 months ago

✨ Big life updates ✨ - Afra Feyza Akyürek and I welcomed our baby! - Successfully defended my PhD and graduated from MIT 🎓 - Joined OpenAI 🍓 Excited for what's next!

thumb_up_off_alt662

chat_bubble_outline58

repeat12

shareShare

Shikhar

@shikharmurty

8 months ago

New #NAACL2025 paper! 🚨 Transformer LMs are data hungry, we propose a new auxiliary loss function (TreeReg) to fix that. TreeReg takes bracketing decisions from syntax trees and turns them into orthogonality constraints on span representations. ✅ Boosts pre-training data

thumb_up_off_alt93

chat_bubble_outline4

repeat22

shareShare

MIT NLP

@nlp_mit

8 months ago

MIT NLP @ ICLR 2025 - catch Mehul Damani at poster 219, Thursday 3PM to chat about "Learning How Hard to Think: Input Adaptive Allocation of LM Computation"!

MIT NLP @ ICLR 2025 - catch
<a href="/MehulDamani2/">Mehul Damani</a> at poster 219, Thursday 3PM to chat about "Learning How Hard to Think: Input Adaptive Allocation of LM Computation"!

thumb_up_off_alt17

chat_bubble_outline0

repeat1

shareShare

Cédric

@cedcolas

8 months ago

i just got an art grant from the council for the arts at MIT! *Tangible Dreams* will let visitors experiment and play with a physical neural network generating images real-time—by twisting knobs and switches, by reconnecting nodes together Arts at MIT

thumb_up_off_alt74

chat_bubble_outline5

repeat12

shareShare

Laura Ruis

@lauraruis

7 months ago

Excited to announce that this fall I'll be joining Jacob Andreas's amazing lab at MIT for a postdoc to work on interp. for reasoning (with Ev (like in 'evidence', not Eve) Fedorenko 🇺🇦 🤯 among others). Cannot wait to think more about this direction in such a dream academic context!

thumb_up_off_alt484

chat_bubble_outline45

repeat11

shareShare

Stanford NLP Group

@stanfordnlp

7 months ago

For this week’s NLP Seminar, we are thrilled to host Jacob Andreas to talk about “Just Asking Questions” When: 5/15 Thurs 11am PT Non-Stanford affiliates registration form: forms.gle/svy5q5uu7anHw7…

For this week’s NLP Seminar, we are thrilled to host <a href="/jacobandreas/">Jacob Andreas</a> to talk about “Just Asking Questions”
When: 5/15 Thurs 11am PT
Non-Stanford affiliates registration form: forms.gle/svy5q5uu7anHw7…

thumb_up_off_alt84

chat_bubble_outline1

repeat12

shareShare

INTERPLAY Workshop

@interplaywrkshp

7 months ago

🚨🚨 Studying the INTERPLAY of LMs' internals and behavior? Join our @colmweb.org workshop on comprehensivly evaluating LMs. Deadline: June 23rd CfP: shorturl.at/sBomu We're excited to see your work!! See you in Montréal 🇨🇦 #nlproc #interpretability

thumb_up_off_alt7

chat_bubble_outline1

repeat4

shareShare