Joel Jang (@jang_yoel) Twitter Tweets • TwiCopy

Chan Hee (Luke) Song

6 months ago

Getting robot data is difficult for those who don’t have the resources, and glad to see NVIDIA Robotics is offering an API for everyone to use!

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Soroush Nasiriany

@snasiriany

6 months ago

It’s not a matter of if, it’s a matter of when, video models and world models are going to be a central tool for building robot foundation models.

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

NVIDIA has published a paper on DREAMGEN – a powerful 4-step pipeline for generating synthetic data for humanoids that enables task and environment generalization. - Step 1: Fine-tune a video generation model using a small number of human teleoperation videos - Step 2: Prompt

thumb_up_off_alt160

chat_bubble_outline2

repeat32

shareShare

Brett Adcock

@adcock_brett

6 months ago

Nvidia also announced DreamGen, a new engine that scales robot learning with digital dreams It produces large volumes of photorealistic robot videos (using video models) paired with motor action labels and unlocks generalization to new environments

thumb_up_off_alt94

chat_bubble_outline3

repeat7

shareShare

Ruijie Zheng

@ruijie_zheng12

6 months ago

Representation also matters for VLA models! Introducing FLARE: Robot Learning with Implicit World Modeling. With future latent alignment objective, FLARE significantly improves policy performance on multitask imitation learning & unlocks learning from egocentric human videos.

thumb_up_off_alt110

chat_bubble_outline6

repeat18

shareShare

Joel Jang

@jang_yoel

5 months ago

Giving a talk about GR00T N1, GR00T N1.5, and GR00T Dreams in NVIDIA GTC Paris 06.11 2PM - 2:45PM CEST. If you are at Vivatech in Paris, please stop by the "An Introduction to Humanoid Robotics" Session!

thumb_up_off_alt61

chat_bubble_outline1

repeat6

shareShare

Yiyang Zhou

@aiyiyangz

5 months ago

🔥 ReAgent-V Released! 🔥 A unified video framework with reflection and reward-driven optimization. ✨ Real-time self-correction. ✨ Triple-view reflection. ✨ Auto-selects high-reward samples for training.

thumb_up_off_alt43

chat_bubble_outline1

repeat17

shareShare

Chris Paxton

@chris_j_paxton

5 months ago

Assuming that we need ~2 trillion tokens to get to a robot GPT, how can we get there? I went through a few scenarios looking at how we can combine simulation data, human video data, and looking at the size of existing robot fleets. Some assumptions: - We probably need some real

thumb_up_off_alt212

chat_bubble_outline12

repeat34

shareShare

Qinsheng Zhang

@qsh_zh

5 months ago

🚀 Introducing Cosmos-Predict2! Our most powerful open video foundation model for Physical AI. Cosmos-Predict2 significantly improves upon Predict1 in visual quality, prompt alignment, and motion dynamics—outperforming popular open-source video foundation models. It’s openly

thumb_up_off_alt203

chat_bubble_outline7

repeat61

shareShare

youliang

@youliangtan

5 months ago

How we improve VLA generalization? 🤔 Last week we upgraded #NVIDIA GR00T N1.5 with minor VLM tweaks, FLARE, and richer data mixtures (DreamGen, etc.) ✨. N1.5 yields better language following — post-trained on unseen Unitree G1 with 1K trajectories, it follows commands on

thumb_up_off_alt186

chat_bubble_outline2

repeat22

shareShare

Joel Jang

@jang_yoel

5 months ago

🚀 GR00T Dreams code is live! NVIDIA GEAR Lab's open-source solution for robotics data via video world models. Fine-tune on any robot, generate 'dreams', extract actions with IDM, and train visuomotor policies with LeRobot datasets (GR00T N1.5, SmolVLA). github.com/NVIDIA/GR00T-D…

thumb_up_off_alt119

chat_bubble_outline4

repeat35

shareShare

Zhengyi “Zen” Luo

@zhengyiluo

5 months ago

Nvidia GEAR RSS 2025 Squad Rolling Out

thumb_up_off_alt152

chat_bubble_outline12

repeat5

shareShare

Joel Jang

@jang_yoel

5 months ago

Check out Cosmos-Predict2, a new SOTA video world model trained specifically for Physical AI (powering GR00T Dreams & DreamGen)!

thumb_up_off_alt44

chat_bubble_outline0

repeat5

shareShare

AgiBot World

@agibotworld

5 months ago

Compete for a $560,000 Prize Pool at IROS 2025 AgiBot World Challenge! 💰 The AgiBot World Challenge – Manipulation Track is LIVE! Hosted by @AgiBot and OpenDriveLab at #IROS2025. 🚀 Challenge: Tackle 10 complex Sim2Real manipulation tasks. 🛠️ Resources: Access a unique

Compete for a $560,000 Prize Pool at IROS 2025 AgiBot World Challenge! 💰
The AgiBot World Challenge – Manipulation Track is LIVE! Hosted by @AgiBot and <a href="/OpenDriveLab/">OpenDriveLab</a> at #IROS2025.
🚀 Challenge: Tackle 10 complex Sim2Real manipulation tasks.
🛠️ Resources: Access a unique

thumb_up_off_alt20

chat_bubble_outline1

repeat11

shareShare

Jim Fan

@drjimfan

4 months ago

I've been a bit quiet on X recently. The past year has been a transformational experience. Grok-4 and Kimi K2 are awesome, but the world of robotics is a wondrous wild west. It feels like NLP in 2018 when GPT-1 was published, along with BERT and a thousand other flowers that

thumb_up_off_alt3,3K

chat_bubble_outline160

repeat318

shareShare

The Humanoid Hub

@thehumanoidhub

4 months ago

A humanoid robot policy trained solely on synthetic data generated by a world model. Research Scientist Joel Jang presents NVIDIA's DreamGen pipeline: ⦿ Post-train the world model Cosmos-Predict2 with a small set of real teleoperation demos. ⦿ Prompt the world model to

thumb_up_off_alt218

chat_bubble_outline9

repeat42

shareShare

Jim Fan

@drjimfan

4 months ago

World modeling for robotics is incredibly hard because (1) control of humanoid robots & 5-finger hands is wayyy harder than ⬆️⬅️⬇️➡️ in games (Genie 3); and (2) object interaction is much more diverse than FSD, which needs to *avoid* coming into contact. Our GR00T Dreams work was

thumb_up_off_alt1,1K

chat_bubble_outline25

repeat123

shareShare

Elon Musk

@elonmusk

4 months ago

Jim Fan Tesla has this too for Optimus. As you say, it is essential for humanoid robot training.

thumb_up_off_alt1,1K

chat_bubble_outline92

repeat169

shareShare