Anush Kini
@ability_guy
Visiting Researcher @UCL | Prev @MSFTResearch
ID: 1711664286408327168
https://abilityguy.github.io/ 10-10-2023 08:45:48
14 Tweet
43 Followers
199 Following
You can find me and Sayak Ray Chowdhury presenting our work today at Poster Session 3 (Poster #913) โ come say hello! #ICML2024
My First Blog! abinezer.github.io/posts/blog-posโฆ Would love feedback and comments! Thanks to my friends Anush Kini and Roshni Poddar for reviewing!
(1/4) Typical LLM post-training mechanisms have a hard time learning models that can produce diverse responses. To fix this we introduce ๐๐๐ (๐๐ข๐ฏ๐๐ซ๐ฌ๐ข๐ญ๐ฒ ๐๐ฎ๐๐ฅ๐ข๐ญ๐ฒ ๐๐ฉ๐ญ๐ข๐ฆ๐ข๐ณ๐๐ญ๐ข๐จ๐ง), a method for post-training LLMs to generate diverse high-quality