Gantavya Bhatt
@bhattgantavya
Ph.D. Student @UW, working in ML/Audio. Summer Intern @nvidia. Previously intern @amazonscience, undergrad @iitdelhi. An active photographer into Alpinism!
ID: 1011498496648548358
https://sites.google.com/view/gbhatt/ 26-06-2018 06:36:49
1,1K Tweet
659 Followers
1,1K Following
Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization 📜 arxiv.org/abs/2404.00530 w/ Ashima Suvarna🌻 Gantavya Bhatt Violet Peng Kai-Wei Chang Aditya Grover (2/3)
In addition to DOVE, also (virtually) presented our work on combinatorial retrieval using submodular information measures ICML Conference DMLR workshop ! Joint work with : Arnav Das Sahil Verma Lilly kumari, Jeff Bilmes
Great summary on model merging and mode connectivity. Also adding our work on 1. Mode connectivity and backdoors: openreview.net/forum?id=SJgwz… 2. Mode connectivity and adversarial examples: arxiv.org/abs/2009.02439 3. Safety loss landscape exploration for LLMs: arxiv.org/abs/2405.17374
New paper📢 LLM folks have been supervised finetuning their models with data from large and expensive models (e.g., Gemini Pro). However, we achieve better perf. by finetuning on the samples from the smaller and weaker LLMs (e.g., Flash)! w/Mehran Kazemi Arian Hosseini Rishabh Agarwal Vinh Q. Tran