Achyuta Rajaram (@achyutabot) 's Twitter Profile
Achyuta Rajaram

@achyutabot

keeping GPUs toasty @mit_csail | @Regeneron STS '24 |@atlasfellow '23

ID: 1320561002644201472

calendar_today26-10-2020 03:00:58

514 Tweet

477 Takipçi

1,1K Takip Edilen

OpenAI (@openai) 's Twitter Profile Photo

Understanding and preventing misalignment generalization Recent work has shown that a language model trained to produce insecure computer code can become broadly “misaligned.” This surprising effect is called “emergent misalignment.” We studied why this happens. Through this

lily (xiaoqing) (@lilysun004) 's Twitter Profile Photo

1/9: Dense SAE Latents Are Features💡, Not Bugs🐛❌! In our new paper, we examine dense (ie. very frequently occuring) SAE latents. We find that dense latents are structured and meaningful, representing truly dense model signals.🧵

1/9: Dense SAE Latents Are Features💡, Not Bugs🐛❌! In our new paper, we examine dense (ie. very frequently occuring) SAE latents. We find that dense latents are structured and meaningful, representing truly dense model signals.🧵
will depue (in singapore for ICLR) (@willdepue) 's Twitter Profile Photo

do not build Infinite Jest (V), do not build the infinite AI TikTok slop machine, do not build the P-zombie AI boy/girlfriend, do not build the child-eating short-form video blackhole, do not build the human-feedback-optimized diffusion transformer porn generator. save yourselves