Hillary Sanders (@hillarymsanders) 's Twitter Profile
Hillary Sanders

@hillarymsanders

Machine-learner, meat-learner, research scientist, AI Safety thinker. Model trainer, skeptical adorer of statistics.

Co-author of: Malware Data Science

ID: 2990366710

linkhttp://hillarysanders.com calendar_today21-01-2015 18:16:30

547 Tweet

560 Followers

85 Following

Hillary Sanders (@hillarymsanders) 's Twitter Profile Photo

Sure, ya can't rely on it, but sometimes inner misalignment (following standard methods of paperclip production even you could make more by taking over humanity) will help deal with with outer misalignment (oops, we trained it to maximize paperclips, which is obvs terribad)

Hillary Sanders (@hillarymsanders) 's Twitter Profile Photo

When I've read a fair amount of research / technical blog posts from someone, and then I watch a video of them speaking about research, and they swear like a sailer. A very well-informed technically accurate neural network specialist sailor. I like it. 😁

Hillary Sanders (@hillarymsanders) 's Twitter Profile Photo

Tried to google "Why do I look so young on Zoom"... That didn't work 😂; I am not searching in the same direction as most people. But I guess that means there's not an automatic face filter??

Hillary Sanders (@hillarymsanders) 's Twitter Profile Photo

OK, completely unrelated to AI, but why do most baby socks come in variety packs? Like, surely most parents care more about being able to find matching socks easily VS. making sure their baby's socks match their daily outfit? I am bemused. bit.ly/3RlK5NA

Brad Chattergoon (@bradchattergoon) 's Twitter Profile Photo

Just found out about Milton Friedman’s Thermostat analogy. Very good explanation of an important concept. I think scott cunningham has a similar analogy with a sailboat staying on course in the face of wind in the Mixtape.

Just found out about Milton Friedman’s Thermostat analogy. Very good explanation of an important concept. I think <a href="/causalinf/">scott cunningham</a> has a similar analogy with a sailboat staying on course in the face of wind in the Mixtape.
AI Notkilleveryoneism Memes ⏸️ (@aisafetymemes) 's Twitter Profile Photo

Open source software is GREAT. But "open source" AI is NOT like software - it's VERY different. Rob Miles cuts through the bullshit: ROB: Oh, hey, Meta. I heard Llama's weights leaked. That's rough, man. Information security's hard. How you holding up? META: Oh, we're

Rob Miles (in SF) (@robertskmiles) 's Twitter Profile Photo

"LLMs can't do logical reasoning at all, they've just memorised some basic rules of logic, and use pattern matching to imperfectly apply those rules to new situations"

Ted Sanders (@sandersted) 's Twitter Profile Photo

AGI is hard to define. my preferred definition of AGI is a computer system that can can accomplish a task impossible for 100 human geniuses working together, such as publishing a blog post with a single canonical spelling of GPT-4o / gpt-4o / gpt4o

Hillary Sanders (@hillarymsanders) 's Twitter Profile Photo

In AI safety, we have inner misalignment (actions don't minimize the loss func) and outer misalignment (misspecified loss func) I do think that inner misalignment (~~learned features) tend to act as a protective mechanism to avoid outer misalignment implications Er, I hope.

Miles Brundage (@miles_brundage) 's Twitter Profile Photo

Trying to imagine aspirin company CEOs signing an open letter saying "we're worried that aspirin might cause an infection that kills everyone on earth - not sure of the solution" and journalists being like "they're just trying to sell more aspirin."

Hillary Sanders (@hillarymsanders) 's Twitter Profile Photo

A super interesting essay my partner wrote on Martin Luther King Jr - what he actually believed and accomplished (different than what is sometimes described). docs.google.com/document/d/1Ol… Incredibly impressive guy.

Hillary Sanders (@hillarymsanders) 's Twitter Profile Photo

Recently gave a live demo at PyCon US 2025 on building scalable AI tool servers using Model Context Protocol (MCP) & Heroku Recording is here: youtube.com/watch?v=01I4yw…

Hillary Sanders (@hillarymsanders) 's Twitter Profile Photo

I went on the Code[ish] podcast to talk about AI, LLMs, and building Heroku's Managed Inference & Agents platform: 🎧heroku.com/podcasts/codei…