
Davis Blalock
@davisblalock
Research scientist + first hire @MosaicML, now @Databricks. @MIT PhD. I post about AI technical progress + sometimes the business side.
ID: 805547773944889344
http://bit.ly/3OXJbDs 04-12-2016 23:02:10
1,1K Tweet
12,12K Followers
170 Following



Thread on our newest paper: 1/n The initial motivation of our project was the "lost in the middle" phenomenon observed by Nelson Liu et al. arxiv.org/pdf/2307.03172 what they observed was models like gpt & claude were bad at retrieving from the middle/end of the input context











New paper📢 LLM folks have been supervised finetuning their models with data from large and expensive models (e.g., Gemini Pro). However, we achieve better perf. by finetuning on the samples from the smaller and weaker LLMs (e.g., Flash)! w/Mehran Kazemi Arian Hosseini Rishabh Agarwal vinh q. tran






