Sanket Kalwar (@kalwarsanket) 's Twitter Profile
Sanket Kalwar

@kalwarsanket

Researcher@RRC,IIIT-H

ID: 1189958493979066368

linkhttp://sanketkalwar.github.io calendar_today31-10-2019 17:33:48

221 Tweet

83 Followers

1,1K Following

Rohit Jayanti (@_rjayanti) 's Twitter Profile Photo

1/8 Meta's SAM2 has some really fun capabilities. Coolest one I found ? Tracking through a pretty wacky transition. All this with just a single point prompt on frame 0. Frame-by-frame breakdown and more examples in the 🧵 below!

Rohit Jayanti (@_rjayanti) 's Twitter Profile Photo

7/8 Finally, had to try Transformers (wink-wink)! Several point prompts but all before the transition through the underpass! Roughly - Point prompts for Bumblebee on frame 050, on the Decepticon on frame 089 (before it disintegrates!), a couple on the bridge on frame 175.

Rohit Jayanti (@_rjayanti) 's Twitter Profile Photo

8/8 Credits - Used Meta's (surprisingly snappy) web demo. Video sources below. [YT] - youtu.be/_57Opqw6Kkc?t=… - youtube.com/watch?v=udKE1k… - youtu.be/_Ufcn3YC-Js?si… [Pexels] - pexels.com/video/person-v… - pexels.com/video/travelin… - pexels.com/video/view-of-…

Nikhil Keetha (@nik__v__) 's Twitter Profile Photo

This is very cool! TLDR; DUSt3R predicting pointmaps in global coordinate frame (eliminating need for BA). A very neat way and implicitly SLAMy. hengyiwang.github.io/projects/spann…

Robotics Lab (@rrclab_iiith) 's Twitter Profile Photo

RRC paper "Open-Set 3D Semantic Instance Maps for Vision Language Navigation (O3D-SIM)" has been accepted in Advanced Robotics! journal Work led by Laksh Nanwani, Kumaraditya Gupta, Aditya Mathur, & Swayam Agrawal, under guidance of Prof. Madhava Krishna & Dr. A. H. Abdul Hafez.

Shubodh Sai (@shubodhs_ai) 's Twitter Profile Photo

Places are composed of things. Recognizing & retrieving these things instead of the whole image enables 🧭viewpoint invariance 🖼️semantic interpretability 🔮open-set recognition 🧵on our #ECCV2024 paper: Revisit Anything: Visual Place Recognition via Image Segment Retrieval 👇

Sourav Garg (@sourav_garg_) 's Twitter Profile Photo

Here’s our new #VPR take on the good old ‘revisiting problem’ in #SLAM / #localization #ECCV2024 European Conference on Computer Vision #ECCV2026 Incredible effort by Shubodh Sai and Kartik Garg Catch Shubodh Sai in person at the venue ILR Workshop: 30 Sep, 10-11am CEST Main Conf: 3 Oct, 10:30am-12:30pm CEST

Sourav Garg (@sourav_garg_) 's Twitter Profile Photo

Quick #success story of #PhD #anxiety test: ✅get paper accepted #ECCV2024 ✅get visa approval on Day 2 of European Conference on Computer Vision #ECCV2026 ✅✈️ UK to Italy on Day 4 (presentation) ✅collect poster minutes before the presentation slot ✅ready to answer tough Qs Catch ahmad khaliq Oct 2, 16:30

Quick #success story of #PhD #anxiety test: 
✅get paper accepted #ECCV2024
✅get visa approval on Day 2 of <a href="/eccvconf/">European Conference on Computer Vision #ECCV2026</a> 
✅✈️ UK to Italy on Day 4 (presentation)
✅collect poster minutes before the presentation slot
✅ready to answer tough Qs
Catch <a href="/imahmadkhaliq/">ahmad khaliq</a> Oct 2, 16:30
Ravi Kiran S (@vikataravi) 's Twitter Profile Photo

It's time for multilingual Indic Vision Language Models ! Happy to present evikrAI, an Indic-flavored technology to assist e-commerce applications. x.com/BharatGen_tech…

Bipasha Sen (@bipashasen31) 's Twitter Profile Photo

🚀 Introducing Frank 🤖—a whole-body robot control system for day-to-day household chores. Frank has been in works for the past year—a tightly coupled hardware and a remote teleoperation interface co-led by Michelle, Nandini, and Pulkit Agrawal. Thanks to Aditya Agarwal and

Aditya Agarwal (@skymanaditya1) 's Twitter Profile Photo

Had a lot of fun recording these videos! 🥳 Hands down the best teleoperation system that I have tried out till date! 🫡

Utkarsh Mishra (@utkarshm0410) 's Twitter Profile Photo

How can robots compositionally generalize over multi-object multi-robot tasks for long-horizon planning? At #CoRL2024, we introduce Generative Factor Chaining (GFC), a diffusion-based approach that composes spatial-temporal factors into long-horizon skill plans. (1/7)

Danfei Xu (@danfei_xu) 's Twitter Profile Photo

We figured out a way to solve long-horizon planning problem by composing a bunch of modular diffusion models in a factor graph! This allows us to reuse the diffusion models in unseen new tasks and achieve zero-shot generalization to multi-robot collaborative manipulation tasks.

Dhruv Patel (@dhruvpatel2012) 's Twitter Profile Photo

Presenting EgoMimic at #CoRL2024! 🎉 Effortless data collection with Project Aria @Meta glasses—just wear & go. Our low-cost manipulator leverages this scalable data to perform grocery handling, laundry, coffee-making & more. Thrilled to be a part of this effort! egomimic.github.io

Danfei Xu (@danfei_xu) 's Twitter Profile Photo

We started this moonshot project a year ago. Now we are excited to share our progress on robot learning from egocentric human data 🕶️🤲 Key idea: Egocentric human data is robot data in disguise. By bridging the kinematic, visual, and distributional gap, we can directly leverage

Pulkit Agrawal (@pulkitology) 's Twitter Profile Photo

Introducing DART: breaking the barriers for robotic data collection by enabling anyone, anywhere in the world to control robots without even having a robot. Just log into dexhub.ai to contribute and control robots at much lower fatigue and higher speed than

Aditya Agarwal (@skymanaditya1) 's Twitter Profile Photo

🚀Excited to share SceneComplete: an open-world 3D scene completion system for constructing a complete, segmented 3D model of a scene from a single RGB-D image.🖼️🤖 SceneComplete enables dexterous grasping and robust robot manipulation in highly cluttered scenes - a short 🧵

IIIT Hyderabad (@iiit_hyderabad) 's Twitter Profile Photo

At the recently concluded International Conference on Intelligent Robots and Systems (IROS 2024) that was held in Abu Dhabi, IIITH’s Robotics Research Centre made a splash with 6 research papers. Here’s a brief roundup of some of that cutting-edge work. blogs.iiit.ac.in/iros-2024/

Dhruv Patel (@dhruvpatel2012) 's Twitter Profile Photo

Thanks for featuring our work Reality Labs at Meta Project Aria @Meta AI at Meta. We show that embodied human data is a scalable data source for robot learning. Check out the full Meta AI blog post covering our story -ai.meta.com/blog/egomimic-…