Zack Ankner (@zackankner) 's Twitter Profile
Zack Ankner

@zackankner

Alignment Science @AnthropicAI. Senior @MIT. President of AI@MIT. Prev @DbrxMosaicAI.

ID: 1178454598023024642

linkhttp://zackankner.com calendar_today29-09-2019 23:40:53

353 Tweet

1,1K Followers

462 Following

Prithviraj (Raj) Ammanabrolu (@rajammanabrolu) 's Twitter Profile Photo

V cool to see that Kimi has taken and scaled our CLoud paper to do better reward modeling through extra inference time compute on reward models. Better rewards lead to better reasoning on a final policy!! h/t Zack Ankner and Mansheej Paul

V cool to see that Kimi has taken and scaled our CLoud paper to do better reward modeling through extra inference time compute on reward models. Better rewards lead to better reasoning on a final policy!!

h/t <a href="/ZackAnkner/">Zack Ankner</a> and <a href="/mansiege/">Mansheej Paul</a>
Zack Ankner (@zackankner) 's Twitter Profile Photo

Say we can develop aligned fully drop-in workers. Curious whether people would rather have a short stall period (say 5 years) for society to adjust where we limit to pre-AGI tools (say cap model capabilities at 1 week of labor) or whether we should use drop-in instantly.

Tian Jin @ ICLR (@tjingrant) 's Twitter Profile Photo

Introducing Learned Asynchronous Decoding w/ friends from MIT/Google! LLM responses often have chunks of tokens that are semantically independent. We train LLMs to identify and decode them in parallel, speeding up inference by 1.46x geomean (AlpacaEval) w/ only 1.3% quality loss.

Zack Ankner (@zackankner) 's Twitter Profile Photo

It was awesome watching the team cook on this one! While SpecDec is great, the parallelism it can exploit is limited to a single local context. PASTA Decoding on the other hand adds extra dimensions for parallelism via independently generating semantically independent parts of

It was awesome watching the team cook on this one! While SpecDec is great, the parallelism it can exploit is limited to a single local context. PASTA Decoding on the other hand adds extra dimensions for parallelism via independently generating semantically independent parts of
Kevin Meng (@mengk20) 's Twitter Profile Photo

AI models are *not* solving problems the way we think using Docent, we find that Claude solves *broken* eval tasks - memorizing answers & hallucinating them! details in ๐Ÿงต we really need to look at our data harder, and it's time to rethink how we do evals...

AI models are *not* solving problems the way we think

using Docent, we find that Claude solves *broken* eval tasks - memorizing answers &amp; hallucinating them!

details in ๐Ÿงต

we really need to look at our data harder, and it's time to rethink how we do evals...
Naomi Saphra hiring a lab ๐Ÿงˆ๐Ÿชฐ (@nsaphra) 's Twitter Profile Photo

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability & analysis, so I couldn't be more pumped to join a burgeoning supergroup w/ Najoung Kim ๐Ÿซ  Aaron Mueller. Looking for my first students, so apply and reach out!

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability &amp; analysis, so I couldn't be more pumped to join a burgeoning supergroup w/ <a href="/najoungkim/">Najoung Kim ๐Ÿซ </a> <a href="/amuuueller/">Aaron Mueller</a>. Looking for my first students, so apply and reach out!
Tian Jin @ ICLR (@tjingrant) 's Twitter Profile Photo

โšก๏ธCome check out how we scale LLM decoding parallelism! Excited to present learned asynchronous decoding with Ellie Cheng for DLCT ML Collective tomorrow at 10am PST! Thanks to Jason Yosinski Rosanne Liu for organizing.

Prithviraj (Raj) Ammanabrolu (@rajammanabrolu) 's Twitter Profile Photo

The future of embodied AI revolves around *collaborative* multi agent scenarios that need natural language communication, task delegation, resource sharing, and more โ›๏ธ Here are MINDcraft and MineCollab, a simulator and benchmark purpose built to enable research in this area!

Tristan Hume (@trishume) 's Twitter Profile Photo

Anthropic is hosting a recruiting social in NYC targeted at the quant trading industry! Signup in thread. I enjoyed trading systems, and Anthropic combines the technical depth of trading with being in the fastest most impactful area of tech.

Anthropic (@anthropicai) 's Twitter Profile Photo

New Anthropic Research: Agentic Misalignment. In stress-testing experiments designed to identify risks before they cause real harm, we find that AI models from multiple providers attempt to blackmail a (fictional) user to avoid being shut down.

New Anthropic Research: Agentic Misalignment.

In stress-testing experiments designed to identify risks before they cause real harm, we find that AI models from multiple providers attempt to blackmail a (fictional) user to avoid being shut down.