Evan Anders (@evanhanders) 's Twitter Profile
Evan Anders

@evanhanders

AI Safety / Mech Interp postdoctoral scholar @KITPUCSB. Former astrophysical fluid dynamicist @Northwestern (CIERA) and @CUBoulder.

ID: 4096173558

linkhttps://evanhanders.bitbucket.io/ calendar_today02-11-2015 00:33:39

29 Tweet

74 Takipçi

154 Takip Edilen

Anthropic (@anthropicai) 's Twitter Profile Photo

New Anthropic research paper: Scaling Monosemanticity. The first ever detailed look inside a leading large language model. Read the blog post here: anthropic.com/research/mappi…

New Anthropic research paper: Scaling Monosemanticity.

The first ever detailed look inside a leading large language model.

Read the blog post here: anthropic.com/research/mappi…
OpenAI (@openai) 's Twitter Profile Photo

We're sharing progress toward understanding the neural activity of language models. We improved methods for training sparse autoencoders at scale, disentangling GPT-4’s internal representations into 16 million features—which often appear to correspond to understandable concepts.