Miles Turpin
@milesaturpin
Language model alignment @nyuniversity
ID:865609028579213312
http://milesturp.in/about 19-05-2017 16:44:09
364 Tweets
957 Followers
1,3K Following
π¨π Following up on 'LMs Don't Always Say What They Think', Miles Turpin et al. now have an intervention that dramatically reduces the problem! ππ¨
It's not a perfect solution, but it's a simple method with few assumptions and it generalizes *much* better than I'd expected.