Danny Tarlow (@dtarlow2) 's Twitter Profile
Danny Tarlow

@dtarlow2

Research Scientist @ Google DeepMind. Tech lead for Gemini's coding abilities.

Twitter account #2, now stronger security.

ID: 1385945814137245700

calendar_today24-04-2021 13:17:45

40 Tweet

917 Followers

330 Following

Google AI (@googleai) 's Twitter Profile Photo

Code-change reviews are a critical and time-consuming part of software development. Learn how we applied recent advances in large sequence models in a real-world setting to automatically resolve code review comments in the day-to-day development workflow →goo.gle/3MRg8lY

Code-change reviews are a critical and time-consuming part of software development. Learn how we applied recent advances in large sequence models in a real-world setting to automatically resolve code review comments in the day-to-day development workflow →goo.gle/3MRg8lY
👩‍💻 Paige Bailey (@dynamicwebpaige) 's Twitter Profile Photo

✨👩‍💻Honored to have been part of this project, and stay tuned for exciting new upgrades soon! Jacob Austin Chris Gorgolewski Valeriya Kharatyan Danny Tarlow Alex Frömmgen, Peter Choy, Gabby Surita, Kevin Villela, and more (and extended contributions, from the Visual Studio Code + Cider-V teams). 🚀

Danny Tarlow (@dtarlow2) 's Twitter Profile Photo

Very happy to share our work on activating Google's software dev process as an engine for ML-powered dev tools. A multi-year effort from many across Alphabet. Special shout-out to Jacob Austin Pascal Lamblin 4️⃣💉😷 Pierre A Manzagol Dan Zheng & Petros Maniatis. See Jacob's🧵& the blog for more.

Jeff Dean (@jeffdean) 's Twitter Profile Photo

Super exciting work by many at Google to push forward ML models that help with many aspects of the software development process. Not just simple code completion, but fixing compilation errors, suggesting test cases, helping w/ code review comments, and all kinds of other things.

👩‍💻 Paige Bailey (@dynamicwebpaige) 's Twitter Profile Photo

✨👩‍💻 Greetings, fellow shape rotators, theorem provers, and code enthusiasts! We're organizing a Code + Math AI meetup at #ICML2023—6:00pm on Thursday, July 27th. There will be appetizers, great company, and also SET card games (if I can scrounge up a few). 😁 📧 DM for

✨👩‍💻 Greetings, fellow shape rotators, theorem provers, and code enthusiasts!

We're organizing a Code + Math AI meetup at #ICML2023—6:00pm on Thursday, July 27th. There will be appetizers, great company, and also SET card games (if I can scrounge up a few). 😁

📧 DM for
Disha Shrivastava (@dishashrivasta9) 's Twitter Profile Photo

Thrilled to share that I successfully defended my PhD thesis today (earning an excellent grade)! Huge thanks to my committee members Aishwarya Agrawal, Baishakhi Ray & Laurent Charlin. Immensely grateful to Hugo Larochelle & Danny Tarlow for being the best supervisors one could ask for! #PhDone

Thrilled to share that I successfully defended my PhD thesis today (earning an excellent grade)! Huge thanks to my committee members <a href="/aagrawalAA/">Aishwarya Agrawal</a>, <a href="/baishakhir/">Baishakhi Ray</a> &amp; <a href="/lcharlin/">Laurent Charlin</a>. Immensely grateful to <a href="/hugo_larochelle/">Hugo Larochelle</a> &amp; <a href="/dtarlow2/">Danny Tarlow</a> for being the best supervisors one could ask for! #PhDone
Daniel Johnson (@_ddjohnson) 's Twitter Profile Photo

New paper: How can you tell when a model is hallucinating? Let it cheat! An expert doesn't need to cheat, so if your model learns to cheat, there must be something it doesn't know. Our general new approach for measuring uncertainty: arxiv.org/abs/2402.08733

New paper: How can you tell when a model is hallucinating? Let it cheat! An expert doesn't need to cheat, so if your model learns to cheat, there must be something it doesn't know.

Our general new approach for measuring uncertainty: arxiv.org/abs/2402.08733
Danny Tarlow (@dtarlow2) 's Twitter Profile Photo

I'm constantly impressed by Daniel's ability to generate (and follow through on) ideas that are creative, intuitive, and rigorously grounded. Very excited by this one!

Daniel Johnson (@_ddjohnson) 's Twitter Profile Photo

Excited to share Penzai, a JAX research toolkit from Google DeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…

Danny Tarlow (@dtarlow2) 's Twitter Profile Photo

This blew me away the first time that Daniel showed it to me, and I'm super happy to see it released for everybody. This kind of hands-on interaction with a model that allows inspection and intervention is so powerful towards developing understandings of complicated models.

Vaibhav Tulsyan (@xennygrimmato_) 's Twitter Profile Photo

Excited to share a new blog on ML-based repair for build errors at Google! We found that automatically repairing build errors in the IDE increases productivity as measured by overall task completion with no detectable negative impact on code safety!

Cursor (@cursor_ai) 's Twitter Profile Photo

Gemini 2.5 Pro is available to all Cursor users! You can enable the full 1M context window if you'd like. We're curious to hear how you think it compares to Sonnet.

Silas Alberti (@silasalberti) 's Twitter Profile Photo

Wow we just ran Gemini 2.5 Pro on our evals and it got a new state of the art. Congrats to the Gemini team! Sharing preliminary results here and working on bringing it into Devin:

Wow we just ran Gemini 2.5 Pro on our evals and it got a new state of the art. Congrats to the Gemini team!

Sharing preliminary results here and working on bringing it into Devin:
Sundar Pichai (@sundarpichai) 's Twitter Profile Photo

Our latest Gemini 2.5 Pro update is now in preview. It’s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads lmarena.ai with a 24pt Elo score jump since the previous version. We also

Our latest Gemini 2.5 Pro update is now in preview.

It’s better at coding, reasoning, science + math, shows improved performance across key benchmarks (AIDER Polyglot, GPQA, HLE to name a few), and leads <a href="/lmarena_ai/">lmarena.ai</a> with a 24pt Elo score jump since the previous version.

We also
Danny Tarlow (@dtarlow2) 's Twitter Profile Photo

Had a fun conversation with Connie and Logan that gives a peek at what we've been doing to make Gemini better at coding over the last year or so and how we're thinking about what's ahead