Julia Neagu (@juliaaneagu) 's Twitter Profile
Julia Neagu

@juliaaneagu

CEO & Co-Founder @QuotientAI ✨ formerly @GitHub @GitHubCopilot 🤖 reformed physicist 👩‍🔬 ~ opinions are my own ~

ID: 1246476937

linkhttps://www.quotientai.co calendar_today06-03-2013 16:28:09

1,1K Tweet

1,1K Takipçi

1,1K Takip Edilen

Julia Neagu (@juliaaneagu) 's Twitter Profile Photo

.Freddie Vargus said he can RL an X bot that only posts topical bangers but it only works on accounts that post occasional bangers. who wants to be in the control group? 🙋

Fede (@vimiumenjoyer) 's Twitter Profile Photo

Uzi is basically your command center for parallel AI coding agents, leveraging Git worktrees to keep them all working without tripping over each other Some are finding success running agents in parallel, but there’s no easy way to do it, let alone manage it at enterprise scale

Uzi is basically your command center for parallel AI coding agents, leveraging Git worktrees to keep them all working without tripping over each other

Some are finding success running agents in parallel, but there’s no easy way to do it, let alone manage it at enterprise scale
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Today's edition (8-Jun) of my newsletter is ready. (Consider subscribing, I write it daily. Link in comments & bio and also you will get a 1300+ page Python book as soon as you subscribe). Prompting with AI scales, Verifying doesn't

Today's edition (8-Jun) of my newsletter is ready.

(Consider subscribing, I write it daily. Link in comments & bio and also you will get a 1300+ page Python book as soon as you subscribe).

Prompting with AI scales, Verifying doesn't
Julia Neagu (@juliaaneagu) 's Twitter Profile Photo

Let's talk about this for a second: “𝘐 𝘳𝘶𝘯 𝘦𝘷𝘢𝘭𝘴 𝘣𝘦𝘧𝘰𝘳𝘦 𝘐 𝘴𝘩𝘪𝘱 𝘴𝘰 𝘐 𝘬𝘯𝘰𝘸 𝘱𝘦𝘳𝘧𝘰𝘳𝘮𝘢𝘯𝘤𝘦 𝘸𝘰𝘯’𝘵 𝘥𝘦𝘨𝘳𝘢𝘥𝘦” This is great but it's like saying “I wrote unit tests so I don’t need monitoring in prod”. Here’s my take on this whole thing:

Julia Neagu (@juliaaneagu) 's Twitter Profile Photo

“You want your model hitting milestones, not minefields.” Most AI eval talk is hand-wavy. This isn’t. Freddie Vargus (Quotient AI CTO) gets into the weeds: how to actually test tool use, avoid minefields, and build agents that don’t break. Check out the recording👇