Zachary Huang (@zacharyhuang12) Twitter Tweets • TwiCopy

Zachary Huang

@zacharyhuang12

+ Follow

Incoming Researcher @MSFTResearch AI Frontiers. I work on LLM Agents and Sys. | Phd @ColumbiaCompSci | Prev: @GraySystemsLab @databricks| Fellowship: @GoogleAI

ID: 1184621381922840576

linkhttps://github.com/zachary62 calendar_today17-10-2019 00:05:28

811 Tweet

1,1K Takipçi

1,1K Takip Edilen

Simon Willison

@simonw

6 months ago

If you use "AI agents" (LLMs that call tools) you need to be aware of the Lethal Trifecta Any time you combine access to private data with exposure to untrusted content and the ability to externally communicate an attacker can trick the system into stealing your data!

thumb_up_off_alt2,2K

chat_bubble_outline70

repeat474

shareShare

Andrej Karpathy

@karpathy

4 months ago

In era of pretraining, what mattered was internet text. You'd primarily want a large, diverse, high quality collection of internet documents to learn from. In era of supervised finetuning, it was conversations. Contract workers are hired to create answers for questions, a bit

thumb_up_off_alt3,3K

chat_bubble_outline158

repeat397

shareShare

Zachary Huang

@zacharyhuang12

3 months ago

X is active income, while Youtube is passive income

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Ben Dicken

@benjdicken

3 months ago

I am once again begging you to put your database servers and application servers in the same region.

thumb_up_off_alt8,8K

chat_bubble_outline203

repeat599

shareShare

Paul Graham

@paulg

3 months ago

You will learn things in a startup, of course. But the way to learn the fastest is to work on whatever you're most curious about, and you don't have that luxury in a startup. In a startup you have to work on whatever users want most.

thumb_up_off_alt2,2K

chat_bubble_outline52

repeat89

shareShare

Ahmad Beirami @ ICLR 2025

@abeirami

3 months ago

This is the conclusion slide of a talk I gave more than a year ago on RL/Alignment! It still holds true today.

thumb_up_off_alt184

chat_bubble_outline2

repeat14

shareShare

Sahil Lavingia

@shl

3 months ago

Airbnb should have been called localhost

thumb_up_off_alt13,13K

chat_bubble_outline263

repeat779

shareShare

Microsoft Research

@msftresearch

3 months ago

As agentic AI ushers in a new era marked by tool expansion, systems are converging, and complexity is rising. Microsoft Research explores the Model Context Protocol (MCP) as a new standard for agent collaboration across fragmented tool ecosystems. msft.it/6014scgGq

thumb_up_off_alt27

chat_bubble_outline2

repeat8

shareShare

Sahil

@sahilypatel

3 months ago

legendary hacker news comments that aged like milk

thumb_up_off_alt1,1K

chat_bubble_outline54

repeat124

shareShare

Vlad Mihalcea

@vlad_mihalcea

3 months ago

As a software engineer, it's very important to learn about Gall’s Law, which states that complex systems cannot be created successfully from scratch. In reality, even large systems, such as Netflix, Google, or Facebook, have started small and built incrementally over the

thumb_up_off_alt7,7K

chat_bubble_outline138

repeat881

shareShare

Nan Jiang

@nanjiang_cs

3 months ago

I was surprised by how many didnt know that (1) per token MLE is whole seq MLE, and (2) PG at token level same as PG at seq level (optimizkng one big combinatorial action). story is different if you introduce fitted critic/Q-values or intermediate resets.

thumb_up_off_alt361

chat_bubble_outline9

repeat38

shareShare

Junyang Lin

@justinlin610

3 months ago

Qwen3-Omni finally, damn it is more than half a year since the release of Qwen2.5-Omni! Last time we thought that we had some successful attempt on unifying audio understanding and generation, yet we were still building small 7B model and we were far lagging behind on data

thumb_up_off_alt848

chat_bubble_outline26

repeat88

shareShare