Charles Sutton (@randomlywalking) 's Twitter Profile
Charles Sutton

@randomlywalking

Research scientist @GoogleAI / Previously academic @InfAtEd / Deep learning to help people write code. / @[email protected] / ❤️s:🐱🐶☕️🍕

ID: 21815759

linkhttp://homepages.inf.ed.ac.uk/csutton/ calendar_today25-02-2009 00:12:29

4,4K Tweet

17,17K Takipçi

1,1K Takip Edilen

Dan Zheng (@dancherp) 's Twitter Profile Photo

Project Big Sleep: exciting work from the team on using LLMs to catch vulnerabilities in real-world code! 💤 googleprojectzero.blogspot.com/2024/10/from-n…

Project Big Sleep: exciting work from the team on using LLMs to catch vulnerabilities in real-world code! 💤

googleprojectzero.blogspot.com/2024/10/from-n…
Ivan Fratric 💙💛 (@ifsecure) 's Twitter Profile Photo

Don't panic now, but LLM-based agent discovered a previously unknown real-world vulnerability. Details in googleprojectzero.blogspot.com/2024/10/from-n…

Royal Hansen (@royalhansen) 's Twitter Profile Photo

"We believe this is the first public example of an AI agent finding a previously unknown exploitable memory-safety issue in widely used real-world software." googleprojectzero.blogspot.com/2024/10/from-n…

Vaibhav Tulsyan (@xennygrimmato_) 's Twitter Profile Photo

Today I'm excited to share with you all that our security AI Agent called "Big Sleep" has found its first vulnerability in an open-source codebase! This has been a great collaborative effort between Google DeepMind and Google Project Zero. googleprojectzero.blogspot.com/2024/10/from-n…

Today I'm excited to share with you all that our security AI Agent called "Big Sleep" has found its first vulnerability in an open-source codebase! This has been a great collaborative effort between
<a href="/GoogleDeepMind/">Google DeepMind</a> and Google Project Zero.

googleprojectzero.blogspot.com/2024/10/from-n…
Phil Venables (@philvenables) 's Twitter Profile Photo

Google’s Big Sleep, evolved from Project Naptime, discovered an exploitable stack buffer underflow in SQLite. We believe this is the first time an AI agent has found a previously unknown exploitable memory-safety issue in real world software: googleprojectzero.blogspot.com/2024/10/from-n…

Charley Snyder (@charley_snyder_) 's Twitter Profile Photo

Exciting work from Google DeepMind and Project Zero: perhaps the first public vulnerability discovered using an LLM agent in a widely-used software component. Early days, but shows the promise of LLMs to find and fix vulnerabilities before they ever make their way into production

Google Public Policy (@googlepubpolicy) 's Twitter Profile Photo

Today Google Project Zero and Google DeepMind shared how they’ve used an LLM to catch vulnerabilities in real world code with their project “Big Sleep.” This is a huge step for using AI to assist today’s cyber defenders. googleprojectzero.blogspot.com/2024/10/from-n…

Jeff Dean (@jeffdean) 's Twitter Profile Photo

This is pretty great. An AI agent based on Gemini 1.5 Pro that is able to use various tools like a debugger, etc. was able to discover a real-world vulnerability in a real and important software package (SQLite), and importantly, before it appeared in an official release.🔒

Varun Godbole (@varungodbole) 's Twitter Profile Photo

Excited to share our prompt tuning playbook! (Not an official product. Just authors tips & tricks for better prompting). I'm most excited about first half on mental models for post-training & prompting. Feedback/forks welcome! #LLM #PromptEngineering github.com/varungodbole/p…

Excited to share our prompt tuning playbook! (Not an official product. Just authors tips &amp; tricks for better prompting).  I'm most excited about first half on mental models for post-training &amp; prompting. Feedback/forks welcome!  #LLM #PromptEngineering

github.com/varungodbole/p…
Maziyar PANAHI (@maziyarpanahi) 's Twitter Profile Photo

Google just released Gemma Embeddings! "GemmaEmbed is a dense-vector embedding model, trained especially for retrieval. As of December 12, 2024, GemmaEmbed achieves the #1 position overall on the MTEB leaderboard, with a score of 72.72."

Google just released Gemma Embeddings!

"GemmaEmbed is a dense-vector embedding model, trained especially for retrieval. As of December 12, 2024, GemmaEmbed achieves the #1 position overall on the MTEB leaderboard, with a score of 72.72."
Isha Puri (@ishapuri101) 's Twitter Profile Photo

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint MIT CSAIL / Red Hat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint <a href="/MIT_CSAIL/">MIT CSAIL</a> / <a href="/RedHat/">Red Hat</a> AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io
Dawn Song (@dawnsongtweets) 's Twitter Profile Photo

📣 Join us for the 5th Advanced LLM Agents MOOC lecture on Coding Agents and AI for Vulnerability Detection, Charles Sutton Google DeepMind, 4:10 pm PT today March 3. 🚀 Join the thriving community of the LLM Agents MOOC series, with 21K+ registered learners & ~9K members on

📣 Join us for the 5th Advanced LLM Agents MOOC lecture on Coding Agents and AI for Vulnerability Detection, <a href="/RandomlyWalking/">Charles Sutton</a> <a href="/GoogleDeepMind/">Google DeepMind</a>, 4:10 pm PT today March 3.
🚀 Join the thriving community of the LLM Agents MOOC series, with 21K+ registered learners &amp; ~9K members on
Dawn Song (@dawnsongtweets) 's Twitter Profile Photo

🚀 Really excited to launch #AgentX competition hosted by UC Berkeley RDI UC Berkeley alongside our LLM Agents MOOC series (a global community of 22k+ learners & growing fast). Whether you're building the next disruptive AI startup or pushing the research frontier, AgentX is your

🚀 Really excited to launch #AgentX competition hosted by <a href="/BerkeleyRDI/">UC Berkeley RDI</a> <a href="/UCBerkeley/">UC Berkeley</a> alongside our LLM Agents MOOC series (a global community of 22k+ learners &amp; growing fast). Whether you're building the next disruptive AI startup or pushing the research frontier, AgentX is your