Zubair Irshad (@mzubairirshad) Twitter Tweets • TwiCopy

Zubair Irshad

@mzubairirshad

+ Follow

Research Scientist @ToyotaResearch | PhD in AI and DL @GeorgiaTech | Researching Large Behavioral Models | 3D Vision | Robotics

ID: 2547626083

linkhttps://zubairirshad.com/ calendar_today05-06-2014 07:38:08

301 Tweet

1,1K Followers

1,1K Following

Zhenjun Zhao

@zhenjun_zhao

7 months ago

FastMap: Revisiting Dense and Scalable Structure from Motion Jiahao Li, Haochen Wang, Zubair Irshad, Igor Vasiljevic, Matthew R. Walter, Vitor Campagnolo Guizilini, Greg Shakhnarovich tl;dr: replace BA with epipolar error+IRLS; fully PyTorch implementation arxiv.org/abs/2505.04612

FastMap: Revisiting Dense and Scalable Structure from Motion

Jiahao Li, <a href="/__whc__/">Haochen Wang</a>, <a href="/mzubairirshad/">Zubair Irshad</a>, <a href="/vslevic/">Igor Vasiljevic</a>, Matthew R. Walter, Vitor Campagnolo Guizilini, <a href="/gregshakh/">Greg Shakhnarovich</a>

tl;dr: replace BA with epipolar error+IRLS; fully PyTorch implementation

arxiv.org/abs/2505.04612

thumb_up_off_alt98

chat_bubble_outline1

repeat21

shareShare

MrNeRF

@janusch_patas

7 months ago

FastMap: Revisiting Dense and Scalable Structure from Motion "FASTMAP, a redesigned SfM framework, achieves fast, high-accuracy dense structure from motion. On large scenes with thousands of images, FASTMAP is up to one to two orders of magnitude faster than GLOMAP and COLMAP.

thumb_up_off_alt287

chat_bubble_outline4

repeat46

shareShare

Alexandre Morgand

@almorgand

7 months ago

FastMap: Revisiting Dense and Scalable Structure from Motion TL;DR: 2 orders of magnitude faster than GLOMAP; many GPU implementations; linear complexity for optimisation; comparable accuracy

thumb_up_off_alt150

chat_bubble_outline2

repeat19

shareShare

Max Fu

@letian_fu

7 months ago

Tired of teleoperating your robots? We built a way to scale robot datasets without teleop, dynamic simulation, or even robot hardware. Just one smartphone scan + one human hand demo video → thousands of diverse robot trajectories. Trainable by diffusion policy and VLA models

thumb_up_off_alt407

chat_bubble_outline21

repeat77

shareShare

Zubair Irshad

@mzubairirshad

7 months ago

Interested in collecting robot training data without robots in the loop? 🦾 Check out this cool new approach that uses a single mobile device scan and a human demo video to generate diverse data for training diffusion and VLA manipulation policies. 🚀 Great work by Max Fu

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

Fang Jiading

@jiading_fang

6 months ago

Ever want to reconstruct and animate everyday articulated objects with no 3D scans or category priors? 🚀Introducing SplArt: Articulation Estimation & Part-Level Reconstruction with 3D Gaussian Splatting! #3Dvision #GaussianSplatting

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Zsolt Kira

@zsoltkira

6 months ago

#CVPR2025 next week will be an exciting one! Check out our work below on VLMs, VLAs, and 3D for robotics (including the first 3D VLMs for Robotics workshop)! Georgia Tech School of Interactive Computing Machine Learning at Georgia Tech

<a href="/CVPR/">#CVPR2025</a> next week will be an exciting one! Check out our work below on VLMs, VLAs, and 3D for robotics (including the first 3D VLMs for Robotics workshop)!

<a href="/ICatGT/">Georgia Tech School of Interactive Computing</a> <a href="/mlatgt/">Machine Learning at Georgia Tech</a>

thumb_up_off_alt20

chat_bubble_outline1

repeat4

shareShare

Shun Iwase

@s1wase

6 months ago

#CVPR2025 starts in two days, and can’t wait to share our new work! 🎉 We present ZeroGrasp, a unified framework for 3D reconstruction and grasp prediction that generalizes to unseen objects. Paper📄: arxiv.org/abs/2504.10857 Webpage🌐:sh8.io/#/zerograsp (1/4 🧵)

thumb_up_off_alt53

chat_bubble_outline2

repeat13

shareShare

1X

@1x_tech

6 months ago

1X World Model Scaling Evaluation for Robots

thumb_up_off_alt801

chat_bubble_outline31

repeat104

shareShare

Katherine Liu

@robo_kat

6 months ago

How can we achieve both common sense understanding that can deal with varying levels of ambiguity in language and dextrous manipulation? Check out CodeDiffuser, a really neat work that bridges Code Gen with a 3D Diffusion Policy! This was a fun project with cool experiments! 🤖

thumb_up_off_alt12

chat_bubble_outline1

repeat5

shareShare

Karl Pertsch

@karlpertsch

6 months ago

We’re releasing the RoboArena today!🤖🦾 Fair & scalable evaluation is a major bottleneck for research on generalist policies. We’re hoping that RoboArena can help! We provide data, model code & sim evals for debugging! Submit your policies today and join the leaderboard! :) 🧵

thumb_up_off_alt399

chat_bubble_outline10

repeat78

shareShare

Kushal

@kushalk_

5 months ago

Teleoperation is slow, expensive, and difficult to scale. So how can we train our robots instead? Introducing X-Sim: a real-to-sim-to-real framework that trains image-based policies 1) learned entirely in simulation 2) using rewards from human videos. portal-cornell.github.io/X-Sim

thumb_up_off_alt111

chat_bubble_outline4

repeat39

shareShare

Zubair Irshad

@mzubairirshad

4 months ago

Great point! I think real2sim is a lucrative way to solve it — potential to scale better but many challenges still remain in getting it to work reliably in the real-world i.e. getting good sim vs real correlations or guarantees. Bonus point: Saves painstaking real-world evals in

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare