weakly typed (@weakly_typed) 's Twitter Profile
weakly typed

@weakly_typed

learning {ML, PL, maths} // CS pre-grad // DMs open :)

ID: 1473283984444510212

calendar_today21-12-2021 13:27:36

323 Tweet

220 Followers

533 Following

weakly typed (@weakly_typed) 's Twitter Profile Photo

I'll be in the US in March (PhD visit days): - Boston: Feb 27 to Mar ~7 - Berkeley: Mar ~7 to Mar ~18 - Pittsburgh: Mar ~18 to Mar 24 would love to meet up if you're around :) (would also appreciate recs for ppl you think I should meet -- or takes on grad school...)

weakly typed (@weakly_typed) 's Twitter Profile Photo

mechinterp people: does anyone have a good (formal?) definition of 'feature' that doesn't assume the linear representation hypothesis? like, if I have some points in high-dim space, what makes them "the composition of several features" as opposed to "some random points"

henry (@arithmoquine) 's Twitter Profile Photo

fyi the real reason i've been ignoring you is: - i want to reply - i want to be able to give you the attention and focus you deserve - i never feel like i have enough energy to properly do that

weakly typed (@weakly_typed) 's Twitter Profile Photo

maybe the most exciting interp result I’ve seen all year (if it ends up being true for interesting reasons): a meaningful step towards uncovering the type of the residual stream

Allen Downey (@allendowney) 's Twitter Profile Photo

On Reddit's statistics forum, the most common question is "What test should I use?" My answer, from 2011, is "There is only one test" allendowney.blogspot.com/2011/05/there-…

On Reddit's statistics forum, the most common question is "What test should I use?"
My answer, from 2011, is "There is only one test"

allendowney.blogspot.com/2011/05/there-…
Transluce (@transluceai) 's Twitter Profile Photo

Announcing Transluce, a nonprofit research lab building open source, scalable technology for understanding AI systems and steering them in the public interest. Read a letter from the co-founders Jacob Steinhardt and Sarah Schwettmann: transluce.org/introducing-tr…

Jason Hausenloy (@jasonhausenloy) 's Twitter Profile Photo

The tragic suicide of Sewell Setzer III shows our generation has become unwitting test subjects in a vast, unregulated AI experiment. That's why we're launching Center for Youth and AI with our Generation AI Survey in TIME. A thread: (1/10)

Samuel Marks (@saprmarks) 's Twitter Profile Photo

This is a really creative and well-executed paper on using "black-box interpretability" methods to understand and control model cognition. Especially impressed by the many applications explored IMO this is an important direction; this paper sets the field on an excellent path!

Mikel Bober-Irizar (@mikb0b) 's Twitter Profile Photo

LLMs are dramatically worse at ARC tasks the bigger they get. However, humans have no such issues - ARC task difficulty is independent of size. Most ARC tasks contain around 512-2048 pixels, and o3 is the first model capable of operating on these text grids reliably.

LLMs are dramatically worse at ARC tasks the bigger they get. However, humans have no such issues - ARC task difficulty is independent of size.

Most ARC tasks contain around 512-2048 pixels, and o3 is the first model capable of operating on these text grids reliably.
Zanzi Tangle, now at Monoidal Cafe (@tangled_zans) 's Twitter Profile Photo

I've recently learned about Algebraic Positional Encoding from Bruno Gavranović and isnt this the coolest breakthrough in mathematical approaches to transformers in the last few years arxiv.org/abs/2312.16045

Naomi Saphra hiring a lab 🧈🪰 (@nsaphra) 's Twitter Profile Photo

Take a break from arxiv/LW/AF. Sit in the woods with a random textbook and mull new ideas away from interp community lockstep. Diverge. Don’t compete with a saturated subtopic, maybe you’ll get to take weekends off. Premature overinvestment comes from monoculture.

Kelsey Piper (@kelseytuoc) 's Twitter Profile Photo

The slowly-unfolding premise of the Good Place is that everyone is damned. They are damned because they participate in the modern world; they buy from sweatshops, they eat chocolate, they fly in airplanes while the poorest people in the world see their harvests fail thanks to