Yufei Ye (@yufei_ye) Twitter Tweets • TwiCopy

Michael Black

@michael_j_black

2 years ago

Check out Silvia Zuffi’s latest work on 3D animals, which connects language with shape. arxiv.org/pdf/2404.03042…

thumb_up_off_alt74

chat_bubble_outline2

repeat10

shareShare

🔔Can VLMs spatially refer objects in Ego? Can VLMs understand interactions? Which hand is holding an object and what's the object in the left hand? We show current VLMs struggle in interactions and release new data & models for HOI-Ref in Ego sid2697.github.io/hoi-ref/ On ArXiv🧵

thumb_up_off_alt53

chat_bubble_outline2

repeat11

shareShare

Francis Engelmann

@francisengelman

2 years ago

🚀Next step in 3D scene understanding beyond objects and parts, we now explore interactive elements and functionalities👇 Check out our #CVPR2025 challenge and win2️⃣RTX4090🏅 (sponsored by Matterport🙏) 🛠️Details: opensun3d.github.io/cvpr24-challen… 📄Paper: scenefun3d.github.io #CVPR2024

thumb_up_off_alt236

chat_bubble_outline3

repeat47

shareShare

John Carmack

@id_aa_carmack

2 years ago

Have CG art tools gotten to the point of modeling all the muscles in a human realistically, so a lot of the skinning and animation would be parametric variations on “baseline human” rather than free form constructions? I have occasionally advocated for that — some would complain

thumb_up_off_alt905

chat_bubble_outline57

repeat56

shareShare

Michael Black

@michael_j_black

2 years ago

John Carmack I used to think that controlling movement through muscles was too complex but this ICLR paper changed my mind. But realism also requires modeling the subcutaneous and visceral adipose tissue (ie fat) and it's complex: more on that at #CVPR2024. github.com/martius-lab/de…

thumb_up_off_alt47

chat_bubble_outline3

repeat5

shareShare

Yufei Ye

@yufei_ye

a year ago

🙌🏻🙌🏻🙌🏻

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Karl Pertsch

@karlpertsch

a year ago

Very excited to release OpenVLA today, a 7B parameter open-source vision-language-action model (VLA). 🦾 SoTA generalist policy (better than Octo & RT-2-X) ⚡️ Easy to run & fine-tune on 1 GPU with quantization and LoRA 💻 Open-source PyTorch codebase 🤗 Models on HuggingFace 1/

thumb_up_off_alt390

chat_bubble_outline4

repeat63

shareShare

Marilyn Keller

@marilyn59846278

a year ago

Wondering what people look like below the skin and fat? 💪🩻💀We introduce HIT: Human Implicit Tissues t.ly/PNJcF at #CVPR 🧵(1/7) 🚨I'm graduating soon and looking for opportunities, so let’s meet at #CVPR2024 📅 Wednesday, June 19 🕥 10:30 - 12:00 📍 Poster 4978

thumb_up_off_alt123

chat_bubble_outline4

repeat34

shareShare

Yufei Ye

@yufei_ye

a year ago

Come and check out our workshop MANGO @ summit 430. Happening now~ New Trends in Multimodal Human Action Perception, Understanding and Generation 🥭🥭🥭 mango-workshop.github.io/2024.html

thumb_up_off_alt21

chat_bubble_outline0

repeat1

shareShare

Yuliang Xiu

@yuliangxiu

a year ago

Code of PuzzleAvatar (#SIGGRAPHAsia2024) gets released. 3D Human reconstruction from unconstrained photo collections (your album), in ANY poses, from ANY views, with ANY cropping or occlusion. NO estimation of body pose or camera is required. github.com/YuliangXiu/Puz…

thumb_up_off_alt212

chat_bubble_outline3

repeat38

shareShare

Homanga Bharadhwaj

@mangahomanga

a year ago

Gen2Act: Casting language-conditioned manipulation as *human video generation* followed by *closed-loop policy execution conditioned on the generated video* enables solving diverse real-world tasks unseen in the robot dataset! homangab.github.io/gen2act/ 1/n

thumb_up_off_alt220

chat_bubble_outline7

repeat53

shareShare

Sudeep Dasari

@sudeepdasari

a year ago

Excited to share my final PhD project😀 We show how simple, yet elegant changes enable diffusion transformers to learn SOTA robotic policies on real robots. Our method improves performance by 20% across a wide range of highly dexterous tasks - like cutting sushi! 1/n

thumb_up_off_alt157

chat_bubble_outline3

repeat25

shareShare

Haochen Shi

@haochenshi74

10 months ago

Time to democratize humanoid robots! Introducing ToddlerBot, a low-cost ($6K), open-source humanoid for robotics and AI research. Watch two ToddlerBots seamlessly chain their loco-manipulation skills to collaborate in tidying up after a toy session. toddlerbot.github.io

thumb_up_off_alt559

chat_bubble_outline28

repeat107

shareShare

Yufei Ye

@yufei_ye

9 months ago

We will be hosting the workshop on "Agents in Interactions, from Humans to Robots" at CVPR2025 #CVPR2025 , welcome to join us by submitting a paper or stopping by our talks/posters! For more info please check out: agents-in-interactions.github.io

We will be hosting the workshop on "Agents in Interactions, from Humans to Robots" at CVPR2025
<a href="/CVPR/">#CVPR2025</a>
, welcome to join us by submitting a paper or stopping by our talks/posters!

For more info please check out:
agents-in-interactions.github.io

thumb_up_off_alt88

chat_bubble_outline2

repeat13

shareShare

Guanya Shi

@guanyashi

9 months ago

Super impressive! A perfect example of how RL can accelerate robotics research, especially when with good hardware & infra: Previous Atlas demos (using TO + MPC) take years, now new Atlas RL demos only take months! Let me try to reverse-engineer the pipeline: 1. Some kind of

thumb_up_off_alt213

chat_bubble_outline5

repeat24

shareShare

Homanga Bharadhwaj

@mangahomanga

9 months ago

We're organizing a workshop on human motion modeling and learning robotic control by observing humans! Come join us #CVPR2025 in Nashville! We have an exciting group of speakers and are also inviting paper submissions with a May 7 deadline 1/n

We're organizing a workshop on human motion modeling and learning robotic control by observing humans!

Come join us <a href="/CVPR/">#CVPR2025</a> in Nashville!

We have an exciting group of speakers and are also inviting paper submissions with a May 7 deadline

1/n

thumb_up_off_alt52

chat_bubble_outline2

repeat12

shareShare

Tarasha Khurana

@tarashakhurana

9 months ago

Reminder that the deadline to submit your cool work to the Workshop on 4D vision @ CVPR 2025 is in about a week!!!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Dandan Shan

@dandanshan_

8 months ago

📢We are organizing the 1st workshop on "Agents in Interactions: from Humans👤to Robots🤖" at #CVPR2025 #CVPR2025! Welcome to join us by submitting your paper or hearing insightful talks from our excellent speakers!! Details: agents-in-interactions.github.io

📢We are organizing the 1st workshop on "Agents in Interactions: from Humans👤to Robots🤖" at #CVPR2025 <a href="/CVPR/">#CVPR2025</a>!

Welcome to join us by submitting your paper or hearing insightful talks from our excellent speakers!!

Details: agents-in-interactions.github.io

thumb_up_off_alt99

chat_bubble_outline1

repeat21

shareShare

Junyi Zhang

@junyi42

8 months ago

Introducing St4RTrack!🖖 Simultaneous 4D Reconstruction and Tracking in the world coordinate feed-forwardly, just by changing the meaning of two pointmaps! st4rtrack.github.io

thumb_up_off_alt267

chat_bubble_outline6

repeat51

shareShare

Yufei Ye

@yufei_ye

7 months ago

Human HOI data from Web helps functional grasp~

thumb_up_off_alt15

chat_bubble_outline0

repeat0

shareShare

Yufei Ye

Michael Black

Dima Damen

Francis Engelmann

John Carmack

Michael Black

Yufei Ye

Karl Pertsch

Marilyn Keller

Yufei Ye

Yuliang Xiu

Homanga Bharadhwaj

Sudeep Dasari

Haochen Shi

Yufei Ye

Guanya Shi

Homanga Bharadhwaj

Tarasha Khurana

Dandan Shan

Junyi Zhang

Yufei Ye