Achyuta Rajaram (@achyutabot) Twitter Tweets • TwiCopy

Achyuta Rajaram

@achyutabot

+ Follow

keeping GPUs toasty @mit_csail | @Regeneron STS '24 |@atlasfellow '23

ID: 1320561002644201472

calendar_today26-10-2020 03:00:58

514 Tweet

477 Takipçi

1,1K Takip Edilen

Achyuta Rajaram

@achyutabot

3 months ago

THANKS GREG HI KSALFJASKLFJALKS

thumb_up_off_alt38

chat_bubble_outline0

repeat0

shareShare

Understanding and preventing misalignment generalization Recent work has shown that a language model trained to produce insecure computer code can become broadly “misaligned.” This surprising effect is called “emergent misalignment.” We studied why this happens. Through this

thumb_up_off_alt3,3K

chat_bubble_outline336

repeat470

shareShare

lily (xiaoqing)

@lilysun004

3 months ago

1/9: Dense SAE Latents Are Features💡, Not Bugs🐛❌! In our new paper, we examine dense (ie. very frequently occuring) SAE latents. We find that dense latents are structured and meaningful, representing truly dense model signals.🧵

thumb_up_off_alt128

chat_bubble_outline5

repeat18

shareShare

Achyuta Rajaram

@achyutabot

3 months ago

THATS MY GOAT HOLY SHIT

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Achyuta Rajaram

@achyutabot

2 months ago

i did not realize you can't use a passport card for international travel 💀

thumb_up_off_alt5

chat_bubble_outline4

repeat0

shareShare

Achyuta Rajaram

@achyutabot

2 months ago

I’m with geohot on this one :)

thumb_up_off_alt21

chat_bubble_outline0

repeat0

shareShare

will depue (in singapore for ICLR)

@willdepue

2 months ago

do not build Infinite Jest (V), do not build the infinite AI TikTok slop machine, do not build the P-zombie AI boy/girlfriend, do not build the child-eating short-form video blackhole, do not build the human-feedback-optimized diffusion transformer porn generator. save yourselves

thumb_up_off_alt1,1K

chat_bubble_outline64

repeat90

shareShare