Daking Rai (@dakingrai) 's Twitter Profile
Daking Rai

@dakingrai

CS PhD Student @GeorgeMasonU

ID: 2828986548

linkhttps://dakingrai.github.io/ calendar_today24-09-2014 00:53:57

29 Tweet

179 Followers

305 Following

Daking Rai (@dakingrai) 's Twitter Profile Photo

[1/6] Mechanistic Interpretability (MI) is an emerging sub-field of interpretability that aims to understand LMs by reverse-engineering its underlying computation. Here we present a comprehensive survey curated specifically as a ๐ ๐ฎ๐ข๐๐ž ๐Ÿ๐จ๐ซ ๐ง๐ž๐ฐ๐œ๐จ๐ฆ๐ž๐ซ๐ฌ ๐ญ๐จ ๐ญ๐ก๐ข๐ฌ

[1/6] Mechanistic Interpretability (MI) is an emerging sub-field of interpretability that aims to understand LMs by reverse-engineering its underlying computation. Here we present a comprehensive survey curated specifically as a ๐ ๐ฎ๐ข๐๐ž ๐Ÿ๐จ๐ซ ๐ง๐ž๐ฐ๐œ๐จ๐ฆ๐ž๐ซ๐ฌ ๐ญ๐จ ๐ญ๐ก๐ข๐ฌ