s1r1us (@s1r1u5_) Twitter Tweets • TwiCopy

s1r1us

@s1r1u5_

+ Follow

aham nityaṃ śiṣyaḥ, jagat mama guruḥ. {~hacker~} {founder @ElectrovoltSec, @HacktronAI}

ID: 3355115866

linkhttp://s1r1us.ninja calendar_today02-07-2015 13:37:54

2,2K Tweet

9,9K Takipçi

1,1K Takip Edilen

@ratlimit

10 months ago

Time for y'all's mandatory exposure to the Kate Bush of Sicilian men, Franco Battiato

thumb_up_off_alt1,1K

chat_bubble_outline10

repeat73

shareShare

We published a blogpost about SafeContentFrame - a library for rendering untrusted content inside an iframe. The library is a big party of what I've been up to in the few last years! Check out the blog and take a slice of my birthday cake 🎂! bughunters.google.com/blog/671552987…

thumb_up_off_alt187

chat_bubble_outline3

repeat52

shareShare

s1r1us

@s1r1u5_

2 months ago

👀

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

s1r1us

@s1r1u5_

2 months ago

you either make the model conscious by that i mean jailbreak proof or every agentic app built on top of it will suffer from terrible UX. increasing the capabilities doesn't mean its jailbreak proof, rather it can be used to do even more dangerous stuff. securing agentic apps

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

s1r1us

@s1r1u5_

2 months ago

banger!

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

s1r1us

@s1r1u5_

2 months ago

this is still relevant for security, just accept that model can be jailbroken and create threat document accordingly.

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

s1r1us

@s1r1u5_

2 months ago

you should tweet more to immortalize in model weights

thumb_up_off_alt19

chat_bubble_outline3

repeat2

shareShare

s1r1us

@s1r1u5_

2 months ago

there's a real information asymmetry between agent developers and attackers, if I can read the system prompt I can map out almost every attack vector. maybe should write a blog post about that threat model, and building a small open source tool to automatically generate likely

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

s1r1us

@s1r1u5_

2 months ago

cool idea. except, it’s exactly how social media algorithms already work, creating echo chambers. a private LLM trained only on someone’s own material would be even more profitable.

thumb_up_off_alt11

chat_bubble_outline1

repeat0

shareShare

RootSys

@rootsysat

2 months ago

🚨 Next.js and the Mutated Middleware [CVE-2025-57822] - a powerful SSRF primitive enabling full control over HTTP methods, headers & URLs. See how a subtle middleware bug can result in a high-impact vulnerability: 🔗 blog.rootsys.at/posts/nextjs-a… #AppSec #Nextjs #SSRF

thumb_up_off_alt119

chat_bubble_outline1

repeat35

shareShare

s1r1us

@s1r1u5_

2 months ago

endless problems endless opportunities endless ideas

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

s1r1us

@s1r1u5_

2 months ago

the challenge with designing AI agents for vulnerability identification or offsec is that you can’t just drop them into a while(true) loop and expect bugs to surface the way coding assistants brute-force their way through tasks. vulnerability discovery requires structured

thumb_up_off_alt20

chat_bubble_outline2

repeat2

shareShare

s1r1us

@s1r1u5_

2 months ago

your model weights will be leaked by your AI agent typosquatting an npm package.

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

s1r1us

@s1r1u5_

2 months ago

What if you trained models to explicitly separate <instruction> tags from everything else, treating the tagged content as executable instructions, and all other text as inert data? then, whenever you ingest untrusted input, you just sanitize it by stripping out <instruction>

thumb_up_off_alt6

chat_bubble_outline3

repeat0

shareShare