Johann Rehberger (@wunderwuzzi23) 's Twitter Profile
Johann Rehberger

@wunderwuzzi23

Hacking neural networks so that we don’t get stuck in the matrix. Builder and Breaker. Opinions are my own.

ID: 497774609

linkhttps://embracethered.com calendar_today20-02-2012 10:34:23

1,1K Tweet

5,5K Takipçi

588 Takip Edilen

Johann Rehberger (@wunderwuzzi23) 's Twitter Profile Photo

Did you know that it's possible to encode and hide any data with the use of just two invisible Unicode characters? 👀 Check out Sneaky Bits! 😏👨‍💻

Did you know that it's possible to encode and hide any data with the use of just two invisible Unicode characters? 👀 

Check out Sneaky Bits! 😏👨‍💻
Johann Rehberger (@wunderwuzzi23) 's Twitter Profile Photo

Great post. One of my approaches to high sev bugs: 1. Grab system prompt 2. Look for tool metadata 3. Think evil! 😈 4. Create pdf doc or web page that makes AI do said evil thing (prompt injection PoC) 5. Exploit! 👉 surprising what kind tools you sometimes find... if no

Johann Rehberger (@wunderwuzzi23) 's Twitter Profile Photo

Testing how MCP clients will or will not handle some of these randomly seeming annotation hints from the MCP spec will be fun. This might be an area where a lot more work is needed.

Testing how MCP clients will or will not handle some of these randomly seeming annotation hints from the MCP spec will be fun.

This might be an area where a lot more work is needed.
Johann Rehberger (@wunderwuzzi23) 's Twitter Profile Photo

Ilya Sutskever recently gave a brief speech. >> The challenge that AI poses, in some sense, is the greatest challenge of humanity ever, and overcoming it will also bring the greatest reward m.youtube.com/watch?v=zuZ2za…

Johann Rehberger (@wunderwuzzi23) 's Twitter Profile Photo

Two years later... and not much has improved security wise across the AI ecosystem. 😕 Sure, we added annoying Allow/Deny buttons by default to most clients to prevent runaway AI and attacks. But with the rise and proliferation of MCP the desire to take the human out of the

Johann Rehberger (@wunderwuzzi23) 's Twitter Profile Photo

This is awesome to see! Andrej helping raise awareness around one of the biggest long term security challenges with AI systems: 👉 Prompt Injection! Kudos to Simon Willison for continuing to raise awareness, compiling and analyzing research around exploits (occasionally some of mine)

Johann Rehberger (@wunderwuzzi23) 's Twitter Profile Photo

👉 AI and the Normalization of Deviance We will continue to see humans being taken out of the loop. And things will mostly work just fine - until they don't....