Charles Sutton (@randomlywalking) Twitter Tweets • TwiCopy

Charles Sutton

@randomlywalking

+ Follow

Research scientist @GoogleAI / Previously academic @InfAtEd / Deep learning to help people write code. / @[email protected] / ❤️s:🐱🐶☕️🍕

ID: 21815759

linkhttp://homepages.inf.ed.ac.uk/csutton/ calendar_today25-02-2009 00:12:29

4,4K Tweet

17,17K Followers

1,1K Following

Dan Zheng

@dancherp

a year ago

Project Big Sleep: exciting work from the team on using LLMs to catch vulnerabilities in real-world code! 💤 googleprojectzero.blogspot.com/2024/10/from-n…

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Ivan Fratric 💙💛

@ifsecure

a year ago

Don't panic now, but LLM-based agent discovered a previously unknown real-world vulnerability. Details in googleprojectzero.blogspot.com/2024/10/from-n…

thumb_up_off_alt198

chat_bubble_outline2

repeat59

shareShare

"We believe this is the first public example of an AI agent finding a previously unknown exploitable memory-safety issue in widely used real-world software." googleprojectzero.blogspot.com/2024/10/from-n…

thumb_up_off_alt18

chat_bubble_outline0

repeat6

shareShare

Vaibhav Tulsyan

@xennygrimmato_

a year ago

Today I'm excited to share with you all that our security AI Agent called "Big Sleep" has found its first vulnerability in an open-source codebase! This has been a great collaborative effort between Google DeepMind and Google Project Zero. googleprojectzero.blogspot.com/2024/10/from-n…

thumb_up_off_alt216

chat_bubble_outline4

repeat39

shareShare

Phil Venables

@philvenables

a year ago

Google’s Big Sleep, evolved from Project Naptime, discovered an exploitable stack buffer underflow in SQLite. We believe this is the first time an AI agent has found a previously unknown exploitable memory-safety issue in real world software: googleprojectzero.blogspot.com/2024/10/from-n…

thumb_up_off_alt18

chat_bubble_outline0

repeat12

shareShare

Charley Snyder

@charley_snyder_

a year ago

Exciting work from Google DeepMind and Project Zero: perhaps the first public vulnerability discovered using an LLM agent in a widely-used software component. Early days, but shows the promise of LLMs to find and fix vulnerabilities before they ever make their way into production

thumb_up_off_alt20

chat_bubble_outline0

repeat6

shareShare

Google Public Policy

@googlepubpolicy

a year ago

Today Google Project Zero and Google DeepMind shared how they’ve used an LLM to catch vulnerabilities in real world code with their project “Big Sleep.” This is a huge step for using AI to assist today’s cyber defenders. googleprojectzero.blogspot.com/2024/10/from-n…

thumb_up_off_alt85

chat_bubble_outline3

repeat21

shareShare

Jeff Dean

@jeffdean

a year ago

This is pretty great. An AI agent based on Gemini 1.5 Pro that is able to use various tools like a debugger, etc. was able to discover a real-world vulnerability in a real and important software package (SQLite), and importantly, before it appeared in an official release.🔒

thumb_up_off_alt573

chat_bubble_outline15

repeat84

shareShare

Varun Godbole

@varungodbole

a year ago

Excited to share our prompt tuning playbook! (Not an official product. Just authors tips & tricks for better prompting). I'm most excited about first half on mental models for post-training & prompting. Feedback/forks welcome! #LLM #PromptEngineering github.com/varungodbole/p…

thumb_up_off_alt614

chat_bubble_outline13

repeat132

shareShare

Maziyar PANAHI

@maziyarpanahi

a year ago

Google just released Gemma Embeddings! "GemmaEmbed is a dense-vector embedding model, trained especially for retrieval. As of December 12, 2024, GemmaEmbed achieves the #1 position overall on the MTEB leaderboard, with a score of 72.72."

thumb_up_off_alt1,1K

chat_bubble_outline23

repeat137

shareShare

Charles Sutton

@randomlywalking

10 months ago

This is a masterpiece! Highly recommended.

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Isha Puri

@ishapuri101

10 months ago

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint MIT CSAIL / Red Hat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint <a href="/MIT_CSAIL/">MIT CSAIL</a> / <a href="/RedHat/">Red Hat</a> AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

thumb_up_off_alt229

chat_bubble_outline2

repeat68

shareShare

Dawn Song

@dawnsongtweets

9 months ago

📣 Join us for the 5th Advanced LLM Agents MOOC lecture on Coding Agents and AI for Vulnerability Detection, Charles Sutton Google DeepMind, 4:10 pm PT today March 3. 🚀 Join the thriving community of the LLM Agents MOOC series, with 21K+ registered learners & ~9K members on

📣 Join us for the 5th Advanced LLM Agents MOOC lecture on Coding Agents and AI for Vulnerability Detection, <a href="/RandomlyWalking/">Charles Sutton</a> <a href="/GoogleDeepMind/">Google DeepMind</a>, 4:10 pm PT today March 3.
🚀 Join the thriving community of the LLM Agents MOOC series, with 21K+ registered learners & ~9K members on

thumb_up_off_alt121

chat_bubble_outline9

repeat18

shareShare

Dawn Song

@dawnsongtweets

9 months ago

🚀 Really excited to launch #AgentX competition hosted by UC Berkeley RDI UC Berkeley alongside our LLM Agents MOOC series (a global community of 22k+ learners & growing fast). Whether you're building the next disruptive AI startup or pushing the research frontier, AgentX is your

🚀 Really excited to launch #AgentX competition hosted by <a href="/BerkeleyRDI/">UC Berkeley RDI</a> <a href="/UCBerkeley/">UC Berkeley</a> alongside our LLM Agents MOOC series (a global community of 22k+ learners & growing fast). Whether you're building the next disruptive AI startup or pushing the research frontier, AgentX is your

thumb_up_off_alt410

chat_bubble_outline20

repeat108

shareShare

Charles Sutton

Dan Zheng

Ivan Fratric 💙💛

Royal Hansen

Vaibhav Tulsyan

Phil Venables

Charley Snyder

Google Public Policy

Jeff Dean

Varun Godbole

Maziyar PANAHI

Charles Sutton

Isha Puri

Dawn Song

Dawn Song