Zack Ankner (@zackankner) Twitter Tweets • TwiCopy

Zack Ankner

@zackankner

a year ago

Need to set a screen time limit for wandb

thumb_up_off_alt43

chat_bubble_outline1

repeat1

shareShare

Zack Ankner

@zackankner

a year ago

Aspartame maxxing

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

V cool to see that Kimi has taken and scaled our CLoud paper to do better reward modeling through extra inference time compute on reward models. Better rewards lead to better reasoning on a final policy!! h/t Zack Ankner and Mansheej Paul

thumb_up_off_alt25

chat_bubble_outline1

repeat3

shareShare

Zack Ankner

@zackankner

a year ago

2 days, 18 cokes, 216 fluid ounces later, the second layer has been assembled .

thumb_up_off_alt22

chat_bubble_outline0

repeat1

shareShare

Zack Ankner

@zackankner

a year ago

If a turing machine writes to tape and no one is around to read the tape, is it really turing complete

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Zack Ankner

@zackankner

10 months ago

If I got aspartame poisoning, I wouldn’t tell anyone but there would be signs

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Zack Ankner

@zackankner

10 months ago

Say we can develop aligned fully drop-in workers. Curious whether people would rather have a short stall period (say 5 years) for society to adjust where we limit to pre-AGI tools (say cap model capabilities at 1 week of labor) or whether we should use drop-in instantly.

thumb_up_off_alt3

chat_bubble_outline2

repeat0

shareShare

Zack Ankner

@zackankner

10 months ago

Only useful benchmark at this point is cursor usage rate

thumb_up_off_alt38

chat_bubble_outline1

repeat2

shareShare

Prithviraj (Raj) Ammanabrolu

@rajammanabrolu

10 months ago

Aligning economic incentives with the long term necessity of human AI colab is one of the hardest challenges of our time

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

Tian Jin @ ICLR

@tjingrant

9 months ago

Introducing Learned Asynchronous Decoding w/ friends from MIT/Google! LLM responses often have chunks of tokens that are semantically independent. We train LLMs to identify and decode them in parallel, speeding up inference by 1.46x geomean (AlpacaEval) w/ only 1.3% quality loss.

thumb_up_off_alt64

chat_bubble_outline4

repeat13

shareShare

Zack Ankner

@zackankner

9 months ago

It was awesome watching the team cook on this one! While SpecDec is great, the parallelism it can exploit is limited to a single local context. PASTA Decoding on the other hand adds extra dimensions for parallelism via independently generating semantically independent parts of

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

Kevin Meng

@mengk20

8 months ago

AI models are *not* solving problems the way we think using Docent, we find that Claude solves *broken* eval tasks - memorizing answers & hallucinating them! details in 🧵 we really need to look at our data harder, and it's time to rethink how we do evals...

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat107

shareShare

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

8 months ago

Life update: I'm starting as faculty at Boston University in 2026! BU has SCHEMES for LM interpretability & analysis, so I couldn't be more pumped to join a burgeoning supergroup w/ Najoung Kim 🫠 Aaron Mueller. Looking for my first students, so apply and reach out!

thumb_up_off_alt430

chat_bubble_outline46

repeat23

shareShare

Tian Jin @ ICLR

@tjingrant

8 months ago

⚡️Come check out how we scale LLM decoding parallelism! Excited to present learned asynchronous decoding with Ellie Cheng for DLCT ML Collective tomorrow at 10am PST! Thanks to Jason Yosinski Rosanne Liu for organizing.

thumb_up_off_alt16

chat_bubble_outline0

repeat6

shareShare

Prithviraj (Raj) Ammanabrolu

@rajammanabrolu

7 months ago

The future of embodied AI revolves around *collaborative* multi agent scenarios that need natural language communication, task delegation, resource sharing, and more ⛏️ Here are MINDcraft and MineCollab, a simulator and benchmark purpose built to enable research in this area!

thumb_up_off_alt207

chat_bubble_outline5

repeat40

shareShare

Tristan Hume

@trishume

7 months ago

Anthropic is hosting a recruiting social in NYC targeted at the quant trading industry! Signup in thread. I enjoyed trading systems, and Anthropic combines the technical depth of trading with being in the fastest most impactful area of tech.

thumb_up_off_alt841

chat_bubble_outline25

repeat35

shareShare

Anthropic

@anthropicai

6 months ago

New Anthropic Research: Agentic Misalignment. In stress-testing experiments designed to identify risks before they cause real harm, we find that AI models from multiple providers attempt to blackmail a (fictional) user to avoid being shut down.

thumb_up_off_alt3,3K

chat_bubble_outline165

repeat573

shareShare