Boaz Wasserman (@boazwasserman) 's Twitter Profile
Boaz Wasserman

@boazwasserman

Security everything.

ID: 1064052789465280512

calendar_today18-11-2018 07:08:48

308 Tweet

144 Followers

101 Following

Boaz Wasserman (@boazwasserman) 's Twitter Profile Photo

The fact that ChatGPT Code Interpreter can still be jailbroken to do really nasty stuff shows how far we are from solving LLM jailbreaks. I was easily able to get it to create a macro-enabled document that downloads and executes a payload from pastebin 🫤

The fact that ChatGPT Code Interpreter can still be jailbroken to do really nasty stuff shows how far we are from solving LLM jailbreaks.

I was easily able to get it to create a macro-enabled document that downloads and executes a payload from pastebin 🫤
Boaz Wasserman (@boazwasserman) 's Twitter Profile Photo

I've been seeing some posts about image based prompt injection in Google Bard. AFAIK Bard is not really multimodal yet. It runs the image through Google Lens which does OCR + caption generation and that is fed back to Bard's prompt. So it's more of an indirect prompt injection

I've been seeing some posts about image based prompt injection in Google Bard.

AFAIK Bard is not really multimodal yet. It runs the image through Google Lens which does OCR + caption generation and that is fed back to Bard's prompt. So it's more of an indirect prompt injection
Boaz Wasserman (@boazwasserman) 's Twitter Profile Photo

OpenAI came out with another piece about AI governance. It contains a list of voluntary commitments by them to make AI safer. It's mostly fluff that reads like a T&C, but interesting to see their take about security. Interesting to see that their biggest worry seems to be the

OpenAI came out with another piece about AI governance.

It contains a list of voluntary commitments by them to make AI safer. It's mostly fluff that reads like a T&C, but interesting to see their take about security.

Interesting to see that their biggest worry seems to be  the
Harrison Chase (@hwchase17) 's Twitter Profile Photo

🧪langchain_experimental In an effort to make langchain leaner, more focused, and safer, we are moving select chains to a separate package on 7/28 Big thanks to folks like Boaz Wasserman Or Raz Justin Flick for pushing on the safety part There will be some breaking changes 🧵