
Nimit Kalra
@qw3rtman
research @haizelabs, aligning rewards. ex @citadel @utaustin
$ pip install verdict
ID: 385428300
https://nimit.io/ 05-10-2011 13:50:20
99 Tweet
781 Followers
2,2K Following

Discussing "Mind the Gap" tonight at Haize Labs's NYC AI Reading Group with Leonard Tang and will brown. Authors study self-improvement through the "Generation-Verification Gap" (model's verification ability over its own generations) and find that this capability log scales with
