๐Ÿš€ Rocket Cannot Be Resampled (@rocketalignment) 's Twitter Profile
๐Ÿš€ Rocket Cannot Be Resampled

@rocketalignment

Writing about AI @theinformation via @Tarbell_Fellows, previously Community Manager @MATSProgram. Signal: (530) 400-4184

ID: 1778838345042350080

calendar_today12-04-2024 17:31:34

474 Tweet

525 Followers

600 Following

๐Ÿš€ Rocket Cannot Be Resampled (@rocketalignment) 's Twitter Profile Photo

Pack it up, everyone. They finally did it. They took the last AI pun for a newsletter name. Transformer, Gradient Updates, 3 Shot Learning (rip), now Hidden Layer. Everyone can go home. They did it. None left. It's done.

Eleos AI Research (@eleosai) 's Twitter Profile Photo

Despite flaws with model self-reports, we interviewed because: 1. Signal: catch red flags for further investigation 2. Scale: as models advance, these methods might yield better evidence 3. Precedent: if and when AIs matter morally, we should ask them how they want to be treated

Despite flaws with model self-reports, we interviewed because:

1. Signal: catch red flags for further investigation
2. Scale: as models advance, these methods might yield better evidence
3. Precedent: if and when AIs matter morally, we should ask them how they want to be treated
Junior Rojas (@junior_rojas_d) 's Twitter Profile Photo

I've been experimenting with attention mechanisms to design locomotion controllers that adapt to different shapes, this is the same controller running on two different bodies github.com/juniorrojas/moโ€ฆ paper coming soon ๐Ÿ‘€

Alex Turner (@turn_trout) 's Twitter Profile Photo

Thought real machine unlearning was impossible? We show that distilling a conventionally โ€œunlearnedโ€ model creates a model resistant to relearning attacks. ๐ƒ๐ข๐ฌ๐ญ๐ข๐ฅ๐ฅ๐š๐ญ๐ข๐จ๐ง ๐ฆ๐š๐ค๐ž๐ฌ ๐ฎ๐ง๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐ซ๐ž๐š๐ฅ.

Thought real machine unlearning was impossible? We show that distilling a conventionally โ€œunlearnedโ€ model creates a model resistant to relearning attacks. ๐ƒ๐ข๐ฌ๐ญ๐ข๐ฅ๐ฅ๐š๐ญ๐ข๐จ๐ง ๐ฆ๐š๐ค๐ž๐ฌ ๐ฎ๐ง๐ฅ๐ž๐š๐ซ๐ง๐ข๐ง๐  ๐ซ๐ž๐š๐ฅ.