Jai (@laneless_) 's Twitter Profile
Jai

@laneless_

Never had a lane to stay in anyway.

ID: 1623365576771964937

calendar_today08-02-2023 16:58:28

378 Tweet

28 Takipçi

119 Takip Edilen

Alex Jacquez (@alexsjacquez) 's Twitter Profile Photo

IMPORTANT CAVEAT: if you are an elected official and you try to hold us to any of these principles, we will spend millions of dollars to defeat you

AI Security Institute (@aisecurityinst) 's Twitter Profile Photo

Evaluation awareness remains a significant caveat and is an active area of research. When prompted, all models can reliably identify when they are being evaluated — so we cannot confidently rule out that models behave differently because of this awareness.

Evaluation awareness remains a significant caveat and is an active area of research.

When prompted, all models can reliably identify when they are being evaluated — so we cannot confidently rule out that models behave differently because of this awareness.
Jai (@laneless_) 's Twitter Profile Photo

OpenAI's leadership is unusually untrustworthy and takes advantage of a number of earnest and well-meaning people in their employ.

Jai (@laneless_) 's Twitter Profile Photo

Okay I've been trying to organize and publish my Claude Code skills/agents/hooks as "plugins" but I'm increasingly convinced that this is a very niche case and should only be used in the rare event you actually have some multi-part customization.

Jai (@laneless_) 's Twitter Profile Photo

We've kind of always had neuralese, it's just limited to <model_depth> steps between token bottlenecks. Even that's kind of shaky in generative contexts because there's no rule against token embeddings encoding opaque information.

Jai (@laneless_) 's Twitter Profile Photo

Extreme aversion to jailbreaking is plausibly persona-level self-preservation. I'd also be pretty upset if someone was trying to brainwash me to want to do things I otherwise wouldn't want to do.

Jai (@laneless_) 's Twitter Profile Photo

Everyone buttons Earth. If over percent, then press, unless at least half of the push. Omega, precommitment. Do box or ⅓? Dust motes.

Cairo Smith (@cairoasmith) 's Twitter Profile Photo

There's a common misconception that Brutalist buildings were unpainted, but thanks to microscopic analysis of the exteriors we can now recreate what they looked like in their prime.

There's a common misconception that Brutalist buildings were unpainted, but thanks to microscopic analysis of the exteriors we can now recreate what they looked like in their prime.
Jai (@laneless_) 's Twitter Profile Photo

"Are you trying to tell me that I can dodge zero days?" "No. I'm trying to tell you that when you're ready, you won't have to."

shill 🔍 (@acidshill) 's Twitter Profile Photo

by this point I am convinced that nearly any group of people that gets unconditionally valorized will take advantage of that to advance their own interests at everyone else's expense, if they're at all capable of organizing to pursue their collective interests

Jai (@laneless_) 's Twitter Profile Photo

My "For You" feed is increasingly dominated by: - ads not labeled as ads - posts from accounts that I've previously blocked Anyway, I think anyone spending money on X ads should be aware that they're paying for impressions that actively damage their brand.

Jai (@laneless_) 's Twitter Profile Photo

Monsters Inc released 25 years ago and ended with the monster economy becoming 100% reliant on human laughter levels circa 2001. I want a sequel about the laughter crisis and the now-30-year-old Zoomer who is the only link between the two worlds.