Jacob Andreas (@jacobandreas) 's Twitter Profile
Jacob Andreas

@jacobandreas

Teaching computers to read.
Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him).
lingo.csail.mit.edu
web.mit.edu/jda/www

ID: 2210861

calendar_today25-03-2007 21:44:59

2,2K Tweet

18,18K Takipçi

949 Takip Edilen

Leshem Choshen C U @ ICLR 🤖🤗 (@lchoshen) 's Twitter Profile Photo

Human feedback is critical for aligning LLMs, so why don’t we collect it in the open ecosystem?🧐 We (15 orgs) gathered the key issues and next steps. Envisioning a community-driven feedback platform, like Wikipedia alphaxiv.org/abs/2408.16961 🧵

Human feedback is critical for aligning LLMs, so why don’t we collect it in the open ecosystem?🧐
We (15 orgs) gathered the key issues and next steps.
Envisioning
a community-driven feedback platform, like Wikipedia

alphaxiv.org/abs/2408.16961
🧵
Transluce (@transluceai) 's Twitter Profile Photo

To interpret AI benchmarks, we need to look at the data. Top-level numbers don't mean what you think: there may be broken tasks, unexpected behaviors, or near-misses. We're introducing Docent to accelerate analysis of AI agent transcripts. It can spot surprises in seconds. 🧵👇

Naomi Saphra hiring a lab 🧈🪰 (@nsaphra) 's Twitter Profile Photo

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability & analysis, so I couldn't be more pumped to join a burgeoning supergroup w/ Najoung Kim 🫠 Aaron Mueller. Looking for my first students, so apply and reach out!

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability &amp; analysis, so I couldn't be more pumped to join a burgeoning supergroup w/ <a href="/najoungkim/">Najoung Kim 🫠</a> <a href="/amuuueller/">Aaron Mueller</a>. Looking for my first students, so apply and reach out!
MIT NLP (@nlp_mit) 's Twitter Profile Photo

Hello everyone! We are quite a bit late to the twitter party, but welcome to the MIT NLP Group account! follow along for the latest research from our labs as we dive deep into language, learning, and logic 🤖📚🧠

Hello everyone! We are quite a bit late to the twitter party, but welcome to the MIT NLP Group account! follow along for the latest research from our labs as we dive deep into language, learning, and logic 🤖📚🧠
Bairu Hou (@hou_bairu) 's Twitter Profile Photo

1/ Long chain-of-thought (CoT) reasoning boosts LLM performance—but with a computational overhead. Checkout our new paper, ThinkPrune, where we explore a simple question: To what extent can we cut the reasoning length while keep the quality? We show that by simply adding a hard

1/ Long chain-of-thought (CoT) reasoning boosts LLM performance—but with a computational overhead.

Checkout our new paper, ThinkPrune, where we explore a simple question: To what extent can we cut the reasoning length while keep the quality?

We show that by simply adding a hard
Ġabe Ġrand (@gabe_grand) 's Twitter Profile Photo

New preprint is live! Tweet thread coming 🚧🔜 📅 Excited to present this work in-person: - 4/11: Poster at New England NLP (NENLP) 2025, Yale University (tomorrow!) - 4/27: Oral talk at VerifAI@ICLR 2025, Singapore

Ben Lipkin (@ben_lipkin) 's Twitter Profile Photo

New preprint on controlled generation from LMs! I'll be presenting at NENLP tomorrow 12:50-2:00pm Longer thread coming soon :)

New preprint on controlled generation from LMs!

I'll be presenting at NENLP tomorrow 12:50-2:00pm

Longer thread coming soon :)
Ġabe Ġrand (@gabe_grand) 's Twitter Profile Photo

Tackling complex problems with LMs requires search/planning, but how should test-time compute be structured? Introducing Self-Steering, a new meta-reasoning framework where LMs coordinate their own inference procedures by writing code!

Ekin Akyürek (@akyurekekin) 's Twitter Profile Photo

✨ Big life updates ✨ - Afra Feyza Akyürek and I welcomed our baby! - Successfully defended my PhD and graduated from MIT 🎓 - Joined OpenAI 🍓 Excited for what's next!

Shikhar (@shikharmurty) 's Twitter Profile Photo

New #NAACL2025 paper! 🚨 Transformer LMs are data hungry, we propose a new auxiliary loss function (TreeReg) to fix that. TreeReg takes bracketing decisions from syntax trees and turns them into orthogonality constraints on span representations. ✅ Boosts pre-training data

MIT NLP (@nlp_mit) 's Twitter Profile Photo

MIT NLP @ ICLR 2025 - catch Mehul Damani at poster 219, Thursday 3PM to chat about "Learning How Hard to Think: Input Adaptive Allocation of LM Computation"!

MIT NLP @ ICLR 2025 - catch
<a href="/MehulDamani2/">Mehul Damani</a> at poster 219, Thursday 3PM to chat about "Learning How Hard to Think: Input Adaptive Allocation of LM Computation"!
Cédric (@cedcolas) 's Twitter Profile Photo

i just got an art grant from the council for the arts at MIT! *Tangible Dreams* will let visitors experiment and play with a physical neural network generating images real-time—by twisting knobs and switches, by reconnecting nodes together Arts at MIT

i just got an art grant from the council for the arts at MIT!

*Tangible Dreams* will let visitors experiment and play with a physical neural network generating images real-time—by twisting knobs and switches, by reconnecting  nodes together <a href="/ArtsatMIT/">Arts at MIT</a>
Laura Ruis (@lauraruis) 's Twitter Profile Photo

Excited to announce that this fall I'll be joining Jacob Andreas's amazing lab at MIT for a postdoc to work on interp. for reasoning (with Ev (like in 'evidence', not Eve) Fedorenko 🇺🇦 🤯 among others). Cannot wait to think more about this direction in such a dream academic context!

Stanford NLP Group (@stanfordnlp) 's Twitter Profile Photo

For this week’s NLP Seminar, we are thrilled to host Jacob Andreas to talk about “Just Asking Questions” When: 5/15 Thurs 11am PT Non-Stanford affiliates registration form: forms.gle/svy5q5uu7anHw7…

For this week’s NLP Seminar, we are thrilled to host <a href="/jacobandreas/">Jacob Andreas</a> to talk about “Just Asking Questions”
When: 5/15 Thurs 11am PT
Non-Stanford affiliates registration form: forms.gle/svy5q5uu7anHw7…