Adam Pearce (@adamrpearce) 's Twitter Profile
Adam Pearce

@adamrpearce

@anthropicai, previously: google brain, @nytgraphics and @bbgvisualdata

ID: 555102816

linkhttp://roadtolarissa.com calendar_today16-04-2012 10:26:54

991 Tweet

5,5K Followers

372 Following

Nithum (@nithum) 's Twitter Profile Photo

Check out our new explorable on machine learning calibration: Machine learning models express their uncertainty as model scores, but through calibration we can transform these scores into probabilities for more effective decision making. pair.withgoogle.com/explorables/un…

Adam Pearce (@adamrpearce) 's Twitter Profile Photo

Most machine learning models are trained by collecting vast amounts of data on a central server. Nicole Mitchell and I looked at how federated learning makes it possible to train models without any user's raw data leaving their device. pair.withgoogle.com/explorables/fe…

Asma Ghandeharioun (@ghandeharioun) 's Twitter Profile Photo

🧵Can we “ask” an LLM to “translate” its own hidden representations into natural language? We propose 🩺Patchscopes, a new framework for decoding specific information from a representation by “patching” it into a separate inference pass, independently of its original context. 1/9

🧵Can we “ask” an LLM to “translate” its own hidden representations into natural language? We propose 🩺Patchscopes, a new framework for decoding specific information from a representation by “patching” it into a separate inference pass, independently of its original context. 1/9
Michael Hanna (@michaelwhanna) 's Twitter Profile Photo

Mateusz and I are excited to announce circuit-tracer, a library that makes circuit-finding simple! Just type in a sentence, and get out a circuit showing (some of) the features your model uses to predict the next token. Try it on neuronpedia: shorturl.at/SUX2A

<a href="/mntssys/">Mateusz</a> and I are excited to announce circuit-tracer, a library that makes circuit-finding simple!

Just type in a sentence, and get out a circuit showing (some of) the features your model uses to predict the next token. Try it on <a href="/neuronpedia/">neuronpedia</a>: shorturl.at/SUX2A
Goodfire (@goodfireai) 's Twitter Profile Photo

New research update! We replicated Anthropic's circuit tracing methods to test if they can recover a known, simple transformer mechanism.

New research update! We replicated <a href="/AnthropicAI/">Anthropic</a>'s circuit tracing methods to test if they can recover a known, simple transformer mechanism.