Zachary Huang (@zacharyhuang12) 's Twitter Profile
Zachary Huang

@zacharyhuang12

Incoming Researcher @MSFTResearch AI Frontiers. I work on LLM Agents and Sys. | Phd @ColumbiaCompSci | Prev: @GraySystemsLab @databricks| Fellowship: @GoogleAI

ID: 1184621381922840576

linkhttps://github.com/zachary62 calendar_today17-10-2019 00:05:28

811 Tweet

1,1K Takipçi

1,1K Takip Edilen

Simon Willison (@simonw) 's Twitter Profile Photo

If you use "AI agents" (LLMs that call tools) you need to be aware of the Lethal Trifecta Any time you combine access to private data with exposure to untrusted content and the ability to externally communicate an attacker can trick the system into stealing your data!

If you use "AI agents" (LLMs that call tools) you need to be aware of the Lethal Trifecta

Any time you combine access to private data with exposure to untrusted content and the ability to externally communicate an attacker can trick the system into stealing your data!
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

In era of pretraining, what mattered was internet text. You'd primarily want a large, diverse, high quality collection of internet documents to learn from. In era of supervised finetuning, it was conversations. Contract workers are hired to create answers for questions, a bit

Paul Graham (@paulg) 's Twitter Profile Photo

You will learn things in a startup, of course. But the way to learn the fastest is to work on whatever you're most curious about, and you don't have that luxury in a startup. In a startup you have to work on whatever users want most.

Microsoft Research (@msftresearch) 's Twitter Profile Photo

As agentic AI ushers in a new era marked by tool expansion, systems are converging, and complexity is rising. Microsoft Research explores the Model Context Protocol (MCP) as a new standard for agent collaboration across fragmented tool ecosystems. msft.it/6014scgGq

As agentic AI ushers in a new era marked by tool expansion, systems are converging, and complexity is rising. Microsoft Research explores the Model Context Protocol (MCP) as a new standard for agent collaboration across fragmented tool ecosystems. msft.it/6014scgGq
Vlad Mihalcea (@vlad_mihalcea) 's Twitter Profile Photo

As a software engineer, it's very important to learn about Gall’s Law, which states that complex systems cannot be created successfully from scratch. In reality, even large systems, such as Netflix, Google, or Facebook, have started small and built incrementally over the

As a software engineer, it's very important to learn about Gall’s Law, which states that complex systems cannot be created successfully from scratch. 

In reality,  even large systems, such as Netflix, Google, or Facebook, have started small and built incrementally over the
Nan Jiang (@nanjiang_cs) 's Twitter Profile Photo

I was surprised by how many didnt know that (1) per token MLE is whole seq MLE, and (2) PG at token level same as PG at seq level (optimizkng one big combinatorial action). story is different if you introduce fitted critic/Q-values or intermediate resets.

Junyang Lin (@justinlin610) 's Twitter Profile Photo

Qwen3-Omni finally, damn it is more than half a year since the release of Qwen2.5-Omni! Last time we thought that we had some successful attempt on unifying audio understanding and generation, yet we were still building small 7B model and we were far lagging behind on data