Clément Dumas (at ICLR)
@butanium_
MSc at @ENS_ParisSaclay prev research intern at DLAB @EPFL
MATS Winter 2025 Scholar w/ Neel Nanda
AI safety research / improv theater
ID: 1071099307045081088
https://butanium.github.io/ 07-12-2018 17:49:09
592 Tweet
356 Followers
439 Following
Another cool paper by Andrew Lee 🚀
With Clément Dumas and Neel Nanda we've just published a post on model diffing that extends our previous paper. Rather than trying to reverse-engineer the full fine-tuned model, model diffing focuses on understanding what makes it different from its base model internally.