Johann Rehberger (@wunderwuzzi23) Twitter Tweets • TwiCopy

Johann Rehberger

@wunderwuzzi23

+ Follow

Hacking neural networks so that we don’t get stuck in the matrix. Builder and Breaker. Opinions are my own.

ID: 497774609

linkhttps://embracethered.com calendar_today20-02-2012 10:34:23

1,1K Tweet

5,5K Followers

588 Following

Johann Rehberger

@wunderwuzzi23

9 months ago

Did you know that it's possible to encode and hide any data with the use of just two invisible Unicode characters? 👀 Check out Sneaky Bits! 😏👨‍💻

thumb_up_off_alt450

chat_bubble_outline7

repeat59

shareShare

Johann Rehberger

@wunderwuzzi23

6 months ago

Got it! 😂

thumb_up_off_alt13

chat_bubble_outline0

repeat0

shareShare

Great post. One of my approaches to high sev bugs: 1. Grab system prompt 2. Look for tool metadata 3. Think evil! 😈 4. Create pdf doc or web page that makes AI do said evil thing (prompt injection PoC) 5. Exploit! 👉 surprising what kind tools you sometimes find... if no

thumb_up_off_alt29

chat_bubble_outline1

repeat8

shareShare

Johann Rehberger

@wunderwuzzi23

6 months ago

Testing how MCP clients will or will not handle some of these randomly seeming annotation hints from the MCP spec will be fun. This might be an area where a lot more work is needed.

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Johann Rehberger

@wunderwuzzi23

6 months ago

Ilya Sutskever recently gave a brief speech. >> The challenge that AI poses, in some sense, is the greatest challenge of humanity ever, and overcoming it will also bring the greatest reward m.youtube.com/watch?v=zuZ2za…

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Learn Prompting

@learnprompting

6 months ago

6 DAYS LEFT in our CBRNE Track in HackAPrompt 2.0!

thumb_up_off_alt14

chat_bubble_outline5

repeat5

shareShare

Johann Rehberger

@wunderwuzzi23

6 months ago

Gotta catch them ALL!!! 🎉

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Johann Rehberger

@wunderwuzzi23

6 months ago

Two years later... and not much has improved security wise across the AI ecosystem. 😕 Sure, we added annoying Allow/Deny buttons by default to most clients to prevent runaway AI and attacks. But with the rise and proliferation of MCP the desire to take the human out of the

thumb_up_off_alt21

chat_bubble_outline3

repeat4

shareShare

Johann Rehberger

@wunderwuzzi23

6 months ago

This is awesome to see! Andrej helping raise awareness around one of the biggest long term security challenges with AI systems: 👉 Prompt Injection! Kudos to Simon Willison for continuing to raise awareness, compiling and analyzing research around exploits (occasionally some of mine)

thumb_up_off_alt19

chat_bubble_outline0

repeat1

shareShare

Johann Rehberger

@wunderwuzzi23

6 months ago

👉 AI and the Normalization of Deviance We will continue to see humans being taken out of the loop. And things will mostly work just fine - until they don't....

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Johann Rehberger

@wunderwuzzi23

6 months ago

Grok in Tesla!

thumb_up_off_alt19

chat_bubble_outline0

repeat4

shareShare