
Allan Zhou
@allanzhou17
AI & robotics research @GoogleDeepMind | Prev: PhD @Stanford, @Google Brain
ID: 1505760644758315011
http://bland.website 21-03-2022 04:18:41
252 Tweet
1,1K Followers
590 Following

๐ Are you training LLMs and manage your training data via a DFS? Do you spend a lot of time writing data wrangling/mixing scripts? โ We just posted a preprint on Mixtera, our data plane for LLM/VLM training๐ ๐ github.com/eth-easl/mixteโฆ ๐ arxiv.org/abs/2502.19790 Read more๐



I've just paid off on this bet to Alex Tabarrok, who won.



๐๐๐ We are organizing a workshop on Building Physically Plausible World Models at ICML Conference 2025! We have a great lineup of speakers, and are inviting you to submit your papers with a May 10 deadline. Website: physical-world-modeling.github.io


Check out our online data selection alg ADO at ICLR 2025! And take a look at this blog post by Yiding Jiang and Allan Zhou summarizing the key ideas: bland.website/notes/ado/





For the last 1.5 days, we've been demoing Gemini Robotics *live* to guests at Google I/O! The robot can converse with you while solving a large range of dexterous tasks ๐ค AI + robotics is so exciting, and we're cooking up a storm Google DeepMind. Can't wait to share more ๐

Say ahoy to ๐๐ฐ๐ธ๐ป๐พ๐โต: a new paradigm of *learning to search* from demonstrations, enabling test-time reasoning about how to recover from mistakes w/o any additional human feedback! ๐๐ฐ๐ธ๐ป๐พ๐ โต out-performs Diffusion Policies trained via behavioral cloning on 5-10x data!


