
seanpixel 🫧
@sean_pixel
jack of some trades
ID: 1108856460304216064
http://seanpixel.com 21-03-2019 22:22:36
1,1K Tweet
933 Takipçi
606 Takip Edilen

you can now improve RL models WITHOUT ANY TRAINING. Inspired by mechanistic interpretability for LLMs (cred: Jacob Dunefsky Emmanuel Ameisen Neel Nanda), I applied sparse-transcoder methods to a CartPole policy and saw a +24% performance increase with zero additional training. (1/9)
