An-Chieh Cheng (@anjjei) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Generalist robot policies should be evaluated on 'generalization' metrics rather than merely reporting success rate in performance in arbitrary scenarios. ★-Gen introduces axes of generalization across inputs/outputs of policies & guides eval benchmarks for robot policies. 1/7

thumb_up_off_alt78

chat_bubble_outline4

repeat12

shareShare

Roger Qiu

@rogerqiu_42

3 months ago

Feature Splatting can now turn Objaverse assets into GS. With optimized kernel, 30K iters can be done in <1min on a single 4090 GPU.

thumb_up_off_alt44

chat_bubble_outline0

repeat11

shareShare

Yufei Ye

@yufei_ye

3 months ago

We will be hosting the workshop on "Agents in Interactions, from Humans to Robots" at CVPR2025 #CVPR2025 , welcome to join us by submitting a paper or stopping by our talks/posters! For more info please check out: agents-in-interactions.github.io

We will be hosting the workshop on "Agents in Interactions, from Humans to Robots" at CVPR2025
<a href="/CVPR/">#CVPR2025</a>
, welcome to join us by submitting a paper or stopping by our talks/posters!

For more info please check out:
agents-in-interactions.github.io

thumb_up_off_alt88

chat_bubble_outline2

repeat13

shareShare

Erik Daxberger

@edaxberger

3 months ago

Check out our new work on exploring 3D Spatial Understanding with Multimodal LLMs!🚀 📀CA-VQA: A fine-tuning dataset and benchmark w/ various input signals and spatial tasks. 🤖MM-Spatial: A generalist MLLM excelling at spatial reasoning. 🔗arxiv.org/abs/2503.13111 🧵(1/n)

thumb_up_off_alt11

chat_bubble_outline3

repeat4

shareShare

Roger Qiu

@rogerqiu_42

3 months ago

Diverse training data leads to a more robust humanoid manipulation policy, but collecting robot demonstrations is slow. Introducing our latest work, Humanoid Policy ~ Human Policy. We advocate human data as a scalable data source for co-training egocentric manipulation policy.⬇️

thumb_up_off_alt244

chat_bubble_outline8

repeat50

shareShare

Xueyan Zou

@xyz2maureen

3 months ago

[1/n] We are releasing M3 (#ICLR2025): a Gaussian Splatting method that builds LMM memories for arbitrary scenes. 🔥 [Efficient] 16 degrees in each Gaussian primitive for one LMM. 🔥 [Alignment] The rendered features are directly in the source LMM embedding space.

thumb_up_off_alt324

chat_bubble_outline2

repeat50

shareShare

Chan Hee (Luke) Song

@luke_ch_song

2 months ago

🔥 VLMs aren’t built for spatial reasoning — yet. They hallucinate free space. Misjudge object fit. Can’t tell below from behind We built RoboSpatial to tackle that — a dataset for teaching spatial understanding to 2D/3D VLMs for robotics. 📝 Perfect review scores #CVPR2025 2025

thumb_up_off_alt391

chat_bubble_outline5

repeat73

shareShare

An-Chieh Cheng

@anjjei

2 months ago

Kids in 2025: writing Tom & Jerry instead of watching them. TTT + diffusion transformer is wild!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Jiarui Xu

@jerry_xu_jiarui

2 months ago

Test-Time Training (TTT) is available in video generation now! We can directly generate complete one-minute video, with great temporal and spatial coherence. We created more episodes of Tom and Jerry (my favorite cartoon in childhood) with our model. test-time-training.github.io/video-dit/

thumb_up_off_alt204

chat_bubble_outline7

repeat30

shareShare

Hongxu (Danny) Yin

@yin_hongxu

2 months ago

#RSS2025 NaVILA constitutes a successful attempt for VILA to drive real world robotic dogs and humanoid! Fully deployable. Money saving. Fast inference. Check out our project page: navila-bot.github.io Many more amazing things to come!

thumb_up_off_alt20

chat_bubble_outline0

repeat6

shareShare

Isabella Liu

@isabella__liu

a month ago

Excited to be at #ICLR2025 in person this year! Looking forward to reconnecting and making new friends.🤩 Come chat with us about Dynamic Gaussians Mesh at poster #97 tomorrow (4/26, 3–5:30pm). See you there!🥳 Website: liuisabella.com/DG-Mesh

thumb_up_off_alt98

chat_bubble_outline2

repeat10

shareShare

Stone Tao

@stone_tao

a month ago

talk by Nicklas Hansen at the world models workshop starting now!

talk by <a href="/ncklashansen/">Nicklas Hansen</a> at the world models workshop starting now!

thumb_up_off_alt36

chat_bubble_outline1

repeat2

shareShare

Andrew Liao

@andrewliao11

a month ago

Takeaway: Structured, reflective reasoning can be taught — even in perception. We show that generating better data can unlock stronger visual reasoning. 🌐Website: andrewliao11.github.io/LongPerceptual… 🤗Dataset: huggingface.co/datasets/andre… 📜Paper: arxiv.org/abs/2504.15362

thumb_up_off_alt2

chat_bubble_outline1

repeat2

shareShare

Manling Li

@manlingli_

a month ago

🚨CVPR Workshop on Foundation Models + Embodied Agents Extending non-archival submission deadline to be after NeurIPS, May 17th! 🌐Website: …models-meet-embodied-agents.github.io/cvpr2025/ 📜OpenReview: openreview.net/group?id=thecv… 👥Program committee sign up form forms.gle/bL17vmr7ZbybxE… ✉️mailing list:

thumb_up_off_alt82

chat_bubble_outline1

repeat16

shareShare

Hanwen Jiang

@hanwenjiang1

a month ago

Supervised learning has held 3D Vision back for too long. Meet RayZer — a self-supervised 3D model trained with zero 3D labels: ❌ No supervision of camera & geometry ✅ Just RGB images And the wild part? RayZer outperforms supervised methods (as 3D labels from COLMAP is noisy)

thumb_up_off_alt391

chat_bubble_outline5

repeat69

shareShare

Xuxin Cheng

@xuxin_cheng

a month ago

Meet 𝐀𝐌𝐎 — our universal whole‑body controller that unleashes the 𝐟𝐮𝐥𝐥  kinematic workspace of humanoid robots to the physical world. AMO is a single policy trained with RL + Hybrid Mocap & Trajectory‑Opt. Accepted to #RSS2025. Try our open models & more 👉

thumb_up_off_alt550

chat_bubble_outline23

repeat113

shareShare

David McAllister

@davidrmcall

a month ago

Humanoids on campus! Check out our new work on context-aware locomotion

thumb_up_off_alt28

chat_bubble_outline0

repeat4

shareShare

RoboPapers

@robopapers

21 days ago

Full episode dropping soon! Geeking out with Roger Qiu on Humanoid Policy ~ Human Policy human-as-robot.github.io Co-hosted by Chris Paxton & Michael Cho - Rbt/Acc

thumb_up_off_alt37

chat_bubble_outline0

repeat8

shareShare

yisha

@yswhynot

10 days ago

For years, I’ve been tuning parameters for robot designs and controllers on specific tasks. Now we can automate this on dataset-scale. Introducing Co-Design of Soft Gripper with Neural Physics - a soft gripper trained in simulation to deform while handling load.

thumb_up_off_alt118

chat_bubble_outline6

repeat33

shareShare

Xiaolong Wang

@xiaolonw

10 days ago

We have been focusing on policy learning for robotics for a while. But can hardware be learned as well? Check out yisha ‘s recent co-design work that learns what a soft gripper should be if we want to do better manipulation.

thumb_up_off_alt80

chat_bubble_outline2

repeat14

shareShare

An-Chieh Cheng

Gate.io

Dorsa Sadigh

Roger Qiu

Yufei Ye

Erik Daxberger

Roger Qiu

Xueyan Zou

Chan Hee (Luke) Song

An-Chieh Cheng

Jiarui Xu

Hongxu (Danny) Yin

Isabella Liu

Stone Tao

Andrew Liao

Manling Li

Hanwen Jiang

Xuxin Cheng

David McAllister

RoboPapers

yisha

Xiaolong Wang