
Avner May
@avnermay
Staff Research Scientist at together.ai.
Formerly research scientist at Google, postdoc at Stanford, and PhD student at Columbia.
ID: 41314685
https://avnermay.github.io/ 20-05-2009 07:16:11
34 Tweet
249 Followers
238 Following


This was joint work between our team (Avner May, Max Ryabinin) and our awesome collaborators at CMU and Yandex: chen zhuoming , Ruslan Svirschevski, Yuhsun Huang, Zhihao Jia, and Beidi Chen.






The Mamba in the Llama: arxiv.org/abs//2408.15237 RNN are neat. Here's a video describing how to make them work really well with little money: youtube.com/watch?v=A5ff8h… (by Junxiong Wang and Daniele Paliotta )


Excited to share our latest work, where we show how to distill from a Llama model into a Mamba hybrid, and how to make speculative decoding work with these models! With Junxiong Wang, Daniele Paliotta, Sasha Rush, Tri Dao.





AI at Meta 🙌 We love that Llama has gone multimodal! We're excited to partner with AI at Meta to offer free access to the Llama 3.2 11B vision model for developers. Can't wait to see what everyone builds! Try now with our Llama-Vision-Free model endpoint. Sign up here:


