Bowen Li (@bw_li1024) Twitter Tweets • TwiCopy

Jiafei Duan

2 years ago

Humans use pointing to communicate plans intuitively. Compared to language, pointing gives more precise guidance to robot behaviors. Can we teach a robot how to point like humans? Introducing RoboPoint 🤖👉, an open-source VLM instruction-tuned to point. Check out our new work:

thumb_up_off_alt219

chat_bubble_outline5

repeat52

shareShare

Tom Silver

@tomssilver

7 months ago

Happy to share a new preprint: "Coloring Between the Lines: Personalization in the Null Space of Planning Constraints" w/ Rajat Kumar Jenamani, Ziang Liu, Ben Dodson, and Tapomayukh "Tapo" Bhattacharjee. TLDR: We propose a method for continual, flexible, active, and safe robot personalization. Links 👇

thumb_up_off_alt34

chat_bubble_outline1

repeat13

shareShare

Rajat Kumar Jenamani

@rkjenamani

7 months ago

Excited to share our work on continual, flexible, active, and safe robot personalization w/ Tom Silver, Ziang Liu, Ben Dodson & Tapomayukh "Tapo" Bhattacharjee. Also: Tom Silver is starting a lab at Princeton!! I HIGHLY recommend joining — thoughtful, kind, and an absolute joy to work with!

thumb_up_off_alt17

chat_bubble_outline1

repeat4

shareShare

Yuheng Qiu

@qiuyuhengqiu

7 months ago

🔥Best Paper Award at #ICRA2025 Thrilled to share that our paper MAC-VO has been awarded the 𝘽𝙚𝙨𝙩 𝘾𝙤𝙣𝙛𝙚𝙧𝙚𝙣𝙘𝙚 𝙋𝙖𝙥𝙚𝙧 𝘼𝙬𝙖𝙧𝙙 and the 𝘽𝙚𝙨𝙩 𝙋𝙖𝙥𝙚𝙧 𝘼𝙬𝙖𝙧𝙙 𝙤𝙣 𝙍𝙤𝙗𝙤𝙩 𝙋𝙚𝙧𝙘𝙚𝙥𝙩𝙞𝙤𝙣! Check our project: mac-vo.github.io

thumb_up_off_alt99

chat_bubble_outline1

repeat12

shareShare

Guanya Shi

@guanyashi

7 months ago

System ID for legged robots is hard: (1) Discontinuous dynamics and (2) many parameters to identify and hard to "excite" them. SPI-Active is a general tool for legged robot system ID. Key ideas: (1) massively parallel sampling-based optimization, (2) structured parameter space,

thumb_up_off_alt245

chat_bubble_outline1

repeat25

shareShare

ARC Prize

@arcprize

7 months ago

Claude Sonnet 4 on ARC-AGI Semi Private Eval Base * ARC-AGI-1: 23%, $0.08/task * ARC-AGI-2: 1.2%, $0.12/task Thinking 16K * ARC-AGI-1: 40%, $0.36/task * ARC-AGI-2: 5.9%, $0.48/task Sonnet 4 sets new SOTA (5.9%) on ARC-AGI-2

thumb_up_off_alt611

chat_bubble_outline41

repeat74

shareShare

Donglai Xiang

@donglaixiang

7 months ago

🚨Excited to announce the 1st Workshop on Vision Meets Physics at @CVPR2025! Join us on June 12 for a full-day event exploring the synergy between physical simulation & computer vision to bridge the gap between the virtual and physical worlds. URL: tinyurl.com/vis-phys

thumb_up_off_alt107

chat_bubble_outline2

repeat14

shareShare

Simeng (Sophia) Han

@hansineng

7 months ago

Zero fluff, maximum insight ✨. Let’s see what LLMs are really made of, with 🧠 Brainteasers. We’re not grading answers 🔢. We’re grading thinking 💭. Brute force? Creative leap? False confession? 🤔 Instead of asking “Did the model get the right answer?”, we ask: “Did it

thumb_up_off_alt66

chat_bubble_outline0

repeat14

shareShare

Changyi Lin

@changyi_lin1

7 months ago

Introducing LocoTouch: Quadrupedal robots equipped with tactile sensing can now transport unsecured objects — no mounts, no straps. The tactile policy transfers zero-shot from sim to real. Core Task-Agnostic Features: 1. High-fidelity contact simulation for distributed tactile

thumb_up_off_alt111

chat_bubble_outline1

repeat31

shareShare

Yilun Du

@du_yilun

7 months ago

Excited to share work on using classical search approaches to scale inference in diffusion models! We show how global graph search algorithms (BFS, DFS) and local search can be used to improve generation performance across domains such as image generation, planning, and RL!

thumb_up_off_alt466

chat_bubble_outline7

repeat57

shareShare

Wenli Xiao

@_wenlixiao

7 months ago

Tired of watching fancy humanoid dancing? Can they just do some daily useful tasks like: "Pass me a bottle of Water🍺"? 🤔Turns out it's nontrivial to stablize whole-body manipulation and locomotion at the same time. We basically want our humanoid to be stable as a camera

thumb_up_off_alt118

chat_bubble_outline5

repeat21

shareShare

Tianyuan Zhang

@tianyuanzhang99

7 months ago

Bored of linear recurrent memories (e.g., linear attention) and want a scalable, nonlinear alternative? Our new paper “Test-Time Training Done Right” propose LaCT (Large Chunk Test-Time Training) — a highly efficient, massively scalable nonlinear memory with: 💡 Pure PyTorch

thumb_up_off_alt390

chat_bubble_outline5

repeat74

shareShare

Raphaël Millière

@raphaelmilliere

7 months ago

Transformer-based neural networks achieve impressive performance on coding, math & reasoning tasks that require keeping track of variables and their values. But how can they do that without explicit memory? 📄 Our new ICML paper investigates this in a synthetic setting! 🧵 1/13

thumb_up_off_alt614

chat_bubble_outline8

repeat94

shareShare

Jon Richens

@jonathanrichens

7 months ago

Are world models necessary to achieve human-level agents, or is there a model-free short-cut? Our new #ICML2025 paper tackles this question from first principles, and finds a surprising answer, agents _are_ world models… 🧵

thumb_up_off_alt1,1K

chat_bubble_outline33

repeat170

shareShare

Jingyun Yang

@yjy0625

7 months ago

Introducing Mobi-π: Mobilizing Your Robot Learning Policy. Our method: ✈️ enables flexible mobile skill chaining 🪶 without requiring additional policy training data 🏠 while scaling to unseen scenes 🧵↓

thumb_up_off_alt282

chat_bubble_outline7

repeat71

shareShare

Seohong Park

@seohong_park

7 months ago

Is RL really scalable like other objectives? We found that just scaling up data and compute is *not* enough to enable RL to solve complex tasks. The culprit is the horizon. Paper: arxiv.org/abs/2506.04168 Thread ↓

thumb_up_off_alt880

chat_bubble_outline9

repeat137

shareShare

Caelan Garrett

@caelangarrett

7 months ago

Check out this MIT News article about our research on GPU-accelerated manipulation planning! news.mit.edu/2025/new-syste… William Shen Nishanth Kumar Ankit Goyal

thumb_up_off_alt9

chat_bubble_outline0

repeat3

shareShare

Jiaheng Hu

@jiahenghu1

7 months ago

Real-world RL, where robots learn directly from physical interactions, is extremely challenging — especially for high-DoF systems like mobile manipulators. 1⃣ Long-horizon tasks and large action spaces lead to difficult policy optimization. 2⃣ Real-world exploration with

thumb_up_off_alt294

chat_bubble_outline4

repeat53

shareShare

Yuchen Zhang

@yuchenzhan54250

6 months ago

Introducing UFM, a Unified Flow & Matching model, which for the first time shows that the unification of optical flow and image matching tasks is mutually beneficial and achieves SOTA. Check out UFM’s matching in action below! 👇 🌐 Website: uniflowmatch.github.io 🧵👇

thumb_up_off_alt349

chat_bubble_outline5

repeat56

shareShare

Tom Silver

@tomssilver

6 months ago

This week's #PaperILike is "Long-Horizon Multi-Robot Rearrangement Planning for Construction Assembly" (Hartmann et al., TRO 2022). Take two minutes to watch this video: youtube.com/watch?v=Gqhouv… I don't use a lot of emojis, but 🤯 PDF: arxiv.org/abs/2106.02489

thumb_up_off_alt55

chat_bubble_outline0

repeat9

shareShare