Zubair Irshad (@mzubairirshad) 's Twitter Profile
Zubair Irshad

@mzubairirshad

Research Scientist @ToyotaResearch | PhD in AI and DL @GeorgiaTech | Researching Large Behavioral Models | 3D Vision | Robotics

ID: 2547626083

linkhttps://zubairirshad.com/ calendar_today05-06-2014 07:38:08

301 Tweet

1,1K Takipçi

1,1K Takip Edilen

Zhenjun Zhao (@zhenjun_zhao) 's Twitter Profile Photo

FastMap: Revisiting Dense and Scalable Structure from Motion Jiahao Li, Haochen Wang, Zubair Irshad, Igor Vasiljevic, Matthew R. Walter, Vitor Campagnolo Guizilini, Greg Shakhnarovich tl;dr: replace BA with epipolar error+IRLS; fully PyTorch implementation arxiv.org/abs/2505.04612

FastMap: Revisiting Dense and Scalable Structure from Motion

Jiahao Li, <a href="/__whc__/">Haochen Wang</a>, <a href="/mzubairirshad/">Zubair Irshad</a>, <a href="/vslevic/">Igor Vasiljevic</a>, Matthew R. Walter, Vitor Campagnolo Guizilini, <a href="/gregshakh/">Greg Shakhnarovich</a>

tl;dr: replace BA with epipolar error+IRLS; fully PyTorch implementation

arxiv.org/abs/2505.04612
MrNeRF (@janusch_patas) 's Twitter Profile Photo

FastMap: Revisiting Dense and Scalable Structure from Motion "FASTMAP, a redesigned SfM framework, achieves fast, high-accuracy dense structure from motion. On large scenes with thousands of images, FASTMAP is up to one to two orders of magnitude faster than GLOMAP and COLMAP.

Alexandre Morgand (@almorgand) 's Twitter Profile Photo

FastMap: Revisiting Dense and Scalable Structure from Motion TL;DR: 2 orders of magnitude faster than GLOMAP; many GPU implementations; linear complexity for optimisation; comparable accuracy

Max Fu (@letian_fu) 's Twitter Profile Photo

Tired of teleoperating your robots? We built a way to scale robot datasets without teleop, dynamic simulation, or even robot hardware. Just one smartphone scan + one human hand demo video → thousands of diverse robot trajectories. Trainable by diffusion policy and VLA models

Zubair Irshad (@mzubairirshad) 's Twitter Profile Photo

Interested in collecting robot training data without robots in the loop? 🦾 Check out this cool new approach that uses a single mobile device scan and a human demo video to generate diverse data for training diffusion and VLA manipulation policies. 🚀 Great work by Max Fu

Fang Jiading (@jiading_fang) 's Twitter Profile Photo

Ever want to reconstruct and animate everyday articulated objects with no 3D scans or category priors? 🚀Introducing SplArt: Articulation Estimation & Part-Level Reconstruction with 3D Gaussian Splatting! #3Dvision #GaussianSplatting

Shun Iwase (@s1wase) 's Twitter Profile Photo

#CVPR2025 starts in two days, and can’t wait to share our new work! 🎉 We present ZeroGrasp, a unified framework for 3D reconstruction and grasp prediction that generalizes to unseen objects. Paper📄: arxiv.org/abs/2504.10857 Webpage🌐:sh8.io/#/zerograsp (1/4 🧵)

Katherine Liu (@robo_kat) 's Twitter Profile Photo

How can we achieve both common sense understanding that can deal with varying levels of ambiguity in language and dextrous manipulation? Check out CodeDiffuser, a really neat work that bridges Code Gen with a 3D Diffusion Policy! This was a fun project with cool experiments! 🤖

Karl Pertsch (@karlpertsch) 's Twitter Profile Photo

We’re releasing the RoboArena today!🤖🦾 Fair & scalable evaluation is a major bottleneck for research on generalist policies. We’re hoping that RoboArena can help! We provide data, model code & sim evals for debugging! Submit your policies today and join the leaderboard! :) 🧵

Kushal (@kushalk_) 's Twitter Profile Photo

Teleoperation is slow, expensive, and difficult to scale. So how can we train our robots instead? Introducing X-Sim: a real-to-sim-to-real framework that trains image-based policies 1) learned entirely in simulation 2) using rewards from human videos. portal-cornell.github.io/X-Sim

Zubair Irshad (@mzubairirshad) 's Twitter Profile Photo

Great point! I think real2sim is a lucrative way to solve it — potential to scale better but many challenges still remain in getting it to work reliably in the real-world i.e. getting good sim vs real correlations or guarantees. Bonus point: Saves painstaking real-world evals in