Mihir Prabhudesai (@mihirp98) Twitter Tweets • TwiCopy

Simo Ryu

@cloneofsimo

4 months ago

Everyone get your top 1% quality dataset and train 100 epochs right now

thumb_up_off_alt475

chat_bubble_outline6

repeat27

shareShare

Sachin Goyal

@goyalsachin007

4 months ago

This looks pretty amazing work, especially the data repetition plots.

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Check it out! 🚀 "Diffusion Beats Autoregressive in Data-Constrainted Settings" They show that Diffusion LLMs outperform Autoregressive LLMs, when allowed to train for multiple epochs! #CMUrobotics Work from Mihir Prabhudesai Mihir Prabhudesai & Mengning Wu Mengning Wu

thumb_up_off_alt23

chat_bubble_outline0

repeat2

shareShare

Mihir Prabhudesai

@mihirp98

4 months ago

Extrapolating this trend to robotics, i believe if one is doing sim2real they should prefer Autoregressive > Diffusion (compute bottleneck). But if they are doing real world training then Autoregressive < Diffusion (data bottleneck).. We don't empirically validate this for

thumb_up_off_alt124

chat_bubble_outline3

repeat10

shareShare

Mihir Prabhudesai

@mihirp98

4 months ago

We ran more experiments, with random token masking, and attention dropout in autoregressive training. Consistent with our earlier ablations, we find these augmentations still overfit quite quickly and are still quite behind diffusion models trained for 500+ epochs. Diffusion

thumb_up_off_alt189

chat_bubble_outline6

repeat17

shareShare

Mihir Prabhudesai

@mihirp98

4 months ago

We ran more experiments to better understand “why” diffusion models do better in data-constrained settings than autoregressive. Our findings support the hypothesis that diffusion models benefit from learning over multiple token orderings, which contributes to their robustness and

thumb_up_off_alt546

chat_bubble_outline8

repeat61

shareShare

Lucas Beyer (bl16)

@giffmana

4 months ago

Amazing! Truly open review, through which we all gained more insights, i love it! Result: in multi epoch setting, making AR learn multiple orderings ~closes the gap to diffusion, explaining much of the difference. How the truly open review happened (from my vague memory): Mihir

thumb_up_off_alt594

chat_bubble_outline16

repeat36

shareShare

Mihir Prabhudesai

@mihirp98

4 months ago

Great video explaining our work + other works in Diffusion LLM space!

thumb_up_off_alt21

chat_bubble_outline1

repeat0

shareShare

Mihir Prabhudesai

@mihirp98

4 months ago

In RENT, we showed LLMs can improve without access to answers - by maximizing confidence. In this work, we go further: LLMs can improve without even having the questions. Using self-play, one LLM learns to ask challenging questions, while other LLM uses confidence to solve them

thumb_up_off_alt20

chat_bubble_outline0

repeat4

shareShare

Mihir Prabhudesai

@mihirp98

4 months ago

Nice work -- great to see some of the core findings from our work being validated :) Our original paper post - x.com/mihirp98/statu… Just to clarify the points raised as issues about our work: 1 - Missing scalar term in the loss – This was a typo during paper writing,

thumb_up_off_alt74

chat_bubble_outline2

repeat2

shareShare

Niklas Muennighoff

@muennighoff

4 months ago

Excited to see recent works push the data-constrained frontier via diffusion LMs! Encoder-Decoders can also repeat a lot more as t5 showed in 2019 - back to Encoder-Decoders? =D

thumb_up_off_alt30

chat_bubble_outline2

repeat4

shareShare

Mihir Prabhudesai

@mihirp98

4 months ago

We do not use Eq. 1 - we use Eq. 2. The Eq. 1 reference was a minor typo (missing 1/r term) fixed within 7 days (July 26) of release. The authors saw this fix on arxiv but still cite our older versions for some reason. Using Eq. 1 makes diffusion outperform AR even at 1 epoch

thumb_up_off_alt43

chat_bubble_outline3

repeat1

shareShare

Sachin Goyal

@goyalsachin007

4 months ago

I myself got confused trying to understand what the serious flaw was, given the new paper had similar takeaways. There is a limit to clickbait stuff please.

thumb_up_off_alt15

chat_bubble_outline0

repeat1

shareShare

Mihir Prabhudesai

@mihirp98

3 months ago

We have released our code here: github.com/wmn-231314/dif… Training code: ✅ Evaluation code: ✅ Model Checkpoints: ✅

thumb_up_off_alt306

chat_bubble_outline5

repeat29

shareShare

Lambda

@lambdaapi

3 months ago

AI that sees, hears, and reasons: superintelligence starts here. #LambdaResearch invites all researchers, engineers and AI enthusiasts to participate in the Grand Challenge on Multimodal Superintelligence. Join us and receive up to $20,000 compute credit per team to build the

thumb_up_off_alt23

chat_bubble_outline3

repeat6

shareShare

Jason Liu

@jasonjzliu

3 months ago

Ever wish a robot could just move to any goal in any environment—avoiding all collisions and reacting in real time? 🚀Excited to share our #CoRL2025 paper, Deep Reactive Policy (DRP), a learning-based motion planner that navigates complex scenes with moving obstacles—directly

thumb_up_off_alt878

chat_bubble_outline21

repeat160

shareShare

Jiahui(Jim) Yang

@jiahui_yang6709

3 months ago

After another wonderful year of neural motion planning research, we are excited to report a major upgrade on our pipeline 🎉 Introducing Deep Reactive Policy (DRP) 🚀 — our #CoRL2025 paper that extends our prior work Neural MP with both generalizability and reactivity while

thumb_up_off_alt425

chat_bubble_outline7

repeat82

shareShare

Homanga Bharadhwaj

@mangahomanga

a month ago

I'll be joining the faculty Johns Hopkins University late next year as a tenure-track assistant professor in JHU Computer Science Looking for PhD students to join me tackling fun problems in robot manipulation, learning from human data, understanding+predicting physical interactions, and beyond!

I'll be joining the faculty <a href="/JohnsHopkins/">Johns Hopkins University</a> late next year as a tenure-track assistant professor in <a href="/JHUCompSci/">JHU Computer Science</a>

Looking for PhD students to join me tackling fun problems in robot manipulation, learning from human data, understanding+predicting physical interactions, and beyond!

thumb_up_off_alt754

chat_bubble_outline75

repeat98

shareShare

Rohan Choudhury

@rchoudhury997

a month ago

Excited to release our new preprint - we introduce Adaptive Patch Transformers (APT), a method to speed up vision transformers by using multiple different patch sizes within the same image!

thumb_up_off_alt156

chat_bubble_outline9

repeat22

shareShare

Mihir Prabhudesai

Simo Ryu

Sachin Goyal

CMU Robotics Institute

Mihir Prabhudesai

Mihir Prabhudesai

Mihir Prabhudesai

Lucas Beyer (bl16)

Mihir Prabhudesai

Mihir Prabhudesai

Mihir Prabhudesai

Niklas Muennighoff

Mihir Prabhudesai

Sachin Goyal

Mihir Prabhudesai

Lambda

Jason Liu

Jiahui(Jim) Yang

Homanga Bharadhwaj

Rohan Choudhury