Allan Zhou (@allanzhou17) Twitter Tweets • TwiCopy

Maximilian Böther

6 months ago

📊 Are you training LLMs and manage your training data via a DFS? Do you spend a lot of time writing data wrangling/mixing scripts? ⌛ We just posted a preprint on Mixtera, our data plane for LLM/VLM training🎉 🔗 github.com/eth-easl/mixte… 🔗 arxiv.org/abs/2502.19790 Read more👇

thumb_up_off_alt7

chat_bubble_outline1

repeat4

shareShare

Chris Paxton

@chris_j_paxton

6 months ago

This on it's own is a super useful capability for roboticists. My top request is and always has been just for a grasping model that *works*

thumb_up_off_alt53

chat_bubble_outline2

repeat4

shareShare

Colin Fraser

@colin_fraser

6 months ago

re-upping this. I think it contains many important truths and mysteries.

thumb_up_off_alt273

chat_bubble_outline18

repeat18

shareShare

Robin Hanson

@robinhanson

6 months ago

I've just paid off on this bet to Alex Tabarrok, who won.

thumb_up_off_alt745

chat_bubble_outline16

repeat29

shareShare

Hongyu Ren

@ren_hongyu

5 months ago

o3-mini-high helps accelerate scientific discovery 💙 arxiv.org/abs/2503.23758

thumb_up_off_alt1,1K

chat_bubble_outline41

repeat146

shareShare

Norman Di Palo

@normandipalo

5 months ago

Last month we announced Gemini Robotics (GR) and Gemini Robotics-ER (GR-ER). GR-ER is a powerful VLM specialised for spatial understanding, including detecting object poses in 2D/3D, pointing, and even *predicting grasp poses*. Take a look at this demo. Details below. 🧵

thumb_up_off_alt205

chat_bubble_outline8

repeat27

shareShare

Sean Kirmani

@seankirmani

5 months ago

🌎🌏🌍 We are organizing a workshop on Building Physically Plausible World Models at ICML Conference 2025! We have a great lineup of speakers, and are inviting you to submit your papers with a May 10 deadline. Website: physical-world-modeling.github.io

🌎🌏🌍 We are organizing a workshop on Building Physically Plausible World Models at <a href="/icmlconf/">ICML Conference</a> 2025!

We have a great lineup of speakers, and are inviting you to submit your papers with a May 10 deadline.

Website: physical-world-modeling.github.io

thumb_up_off_alt101

chat_bubble_outline1

repeat22

shareShare

Sadhika Malladi

@sadhikamalladi

5 months ago

Check out our online data selection alg ADO at ICLR 2025! And take a look at this blog post by Yiding Jiang and Allan Zhou summarizing the key ideas: bland.website/notes/ado/

thumb_up_off_alt44

chat_bubble_outline0

repeat10

shareShare

Yiding Jiang

@yidingjiang

4 months ago

I will be at #ICLR2025 until Monday. Looking forward to meeting old and new friends. If you want to chat about generalization / RL / curriculum learning / compression & algorithmic info theory (or anything really 😬), please DM me! Otherwise, I will be presenting 2 papers:

thumb_up_off_alt83

chat_bubble_outline2

repeat6

shareShare

Keegan Harris

@keegan_w_harris

4 months ago

Back in March, I wore a head-mounted camera for a week straight and fine-tuned ChatGPT on the resulting data. Here's what happened (1/6) arxiv.org/pdf/2504.03857

thumb_up_off_alt21

chat_bubble_outline2

repeat4

shareShare

fofr

@fofrai

3 months ago

NO WAY. It did it. And, was that, actually funny? Prompt: > a man doing stand up comedy in a small venue tells a joke (include the joke in the dialogue)

thumb_up_off_alt8,8K

chat_bubble_outline307

repeat633

shareShare

Allan Zhou

@allanzhou17

3 months ago

If you haven't already seen our Gemini Robotics demo at #GoogleIO, today's your chance! You'll give commands to our robots and watch them do the work, using only your voice. If you're not at IO, check out: youtu.be/BKM3vohmED8?fe…

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Sudeep Dasari

@sudeepdasari

3 months ago

For the last 1.5 days, we've been demoing Gemini Robotics *live* to guests at Google I/O! The robot can converse with you while solving a large range of dexterous tasks 🤖 AI + robotics is so exciting, and we're cooking up a storm Google DeepMind. Can't wait to share more 🚀

thumb_up_off_alt99

chat_bubble_outline2

repeat11

shareShare

Gokul Swamy

@g_k_swamy

3 months ago

Say ahoy to 𝚂𝙰𝙸𝙻𝙾𝚁⛵: a new paradigm of *learning to search* from demonstrations, enabling test-time reasoning about how to recover from mistakes w/o any additional human feedback! 𝚂𝙰𝙸𝙻𝙾𝚁 ⛵ out-performs Diffusion Policies trained via behavioral cloning on 5-10x data!

thumb_up_off_alt247

chat_bubble_outline10

repeat64

shareShare

Keerthana Gopalakrishnan

@keerthanpg

3 months ago

Given the progress in robot mobility & locomotion lately, we are throwing a robot fashion show at CoRL 2025! 😂 Deadline to apply is July 15th. This is going to be weird and lengendary - at the pareto frontier of art and tech, please consider submitting! corl.org/contributions/…

thumb_up_off_alt66

chat_bubble_outline2

repeat7

shareShare

Yiding Jiang

@yidingjiang

2 months ago

A mental model I find useful: all data acquisition (web scrapes, synthetic data, RL rollouts, etc.) is really an exploration problem 🔍. This perspective has some interesting implications for where AI is heading. Wrote down some thoughts: yidingjiang.github.io/blog/post/expl…

thumb_up_off_alt412

chat_bubble_outline5

repeat56

shareShare

Lucas Wang@CoRL

@wlt5678

2 months ago

Gemini Robotics zero shot picks a dextrous hand: No prior demos, not even videos. It recognized, failed to grasp (slippery surface), retried with new angles, got help, nailed the pick, adjusted post-pick. Mad respect to DeepMind team. Now I really worry about human labor 😅

thumb_up_off_alt140

chat_bubble_outline4

repeat13

shareShare