Sabera Talukder (@saberatalukder) 's Twitter Profile
Sabera Talukder

@saberatalukder

🚀🚀🚀

@Caltech, @GoogleAI, @Stanford

ID: 998026570697617408

linkhttp://www.sabera.ai calendar_today20-05-2018 02:24:11

1,1K Tweet

3,3K Followers

31 Following

Geeling Chau (@geelingc) 's Twitter Profile Photo

I will touch on some work we've (Christopher Wang, Sabera Talukder, Vighnesh Subramaniam, Saraswati, Yisong Yue, Boris Katz, Andrei Barbu) done on pretraining transformers across variable input layouts for 🔥 decoding and interpretability. Read more here: arxiv.org/abs/2406.03044

Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

Late last year TOTEM was: 1⃣ Accepted to TMLR 🥳 openreview.net/pdf?id=QlTLkH6… 2⃣ Crossed 100 ⭐️s on Github github.com/SaberaTalukder… 3⃣ Released a video summary (🙏 Steven Brunton) youtube.com/watch?v=OqrCpd… Glad you're feeling the tokens! More coming soon 😉

Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

Proud of Geeling Chau leading this new work on neural time series modeling, which just received Oral (top 1.8%) ICLR 2026 2025! Always a delight to support her persistence! Arxiv version: arxiv.org/pdf/2406.03044

Damiano Marsili (@marsilidamiano) 's Twitter Profile Photo

1/5 LLMs are notoriously not great at 3D perception … but what if there were an alternative path to scaling 3D VQA? 👀 Introducing VADAR: a new program synthesis approach for 3D spatial reasoning! glab-caltech.github.io/vadar/

Aadarsh Sahoo (@sahooaadarsh) 's Twitter Profile Photo

Introducing Kyvo! 🚀 – a decoder-only LLM that aligns text, images & structured 3D scenes token-by-token. From a single image, it reconstructs individual 3D shapes and their locations, renders & edits scenes, answers spatial questions, and more. 💻: glab-caltech.github.io/kyvo/

Yisong Yue (@yisongyue) 's Twitter Profile Photo

Honored to receive the mentoring award from the Grad Student Advisory Board at Caltech EAS. Thanks to the students who nominated me, especially Sabera Talukder!

Honored to receive the mentoring award from the Grad Student Advisory Board at <a href="/Caltech/">Caltech</a> EAS.  Thanks to the students who nominated me, especially <a href="/SaberaTalukder/">Sabera Talukder</a>!
Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

Incredibly well deserved, Yisong Yue is simply the best! Also huge shout out to Hongkai Zheng & Raul Astudillo who also wrote wonderful letters and the rest of the Yue Lab Caltech + friends who endorsed 🦫🧡

Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

Inspired time Laude Institute, huge congrats to Andy Konwinski on the launch! Humbled to be asked to speak on the dos of impact research in front of such 🤩 researchers!! P.S. 👀 the where's waldo moment of my favorite +1 (Yisong Yue) being (I hope 😅) a proud academic dad 🪞📸

Inspired time <a href="/LaudeInstitute/">Laude Institute</a>, huge congrats to <a href="/andykonwinski/">Andy Konwinski</a> on the launch!

Humbled to be asked to speak on the dos of impact research in front of such 🤩 researchers!!

P.S. 👀 the where's waldo moment of my favorite +1 (<a href="/yisongyue/">Yisong Yue</a>) being (I hope 😅) a proud academic dad 🪞📸
Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

Collaboration is important for impactful work, but it is not enough! Impact comes from the collaboration between two types of people. The generators and discriminators (pun very much intended 😉). The generators dream big, swing for the fences, and don’t listen when they are

Albert Li (@albert_h_li) 's Twitter Profile Photo

Super excited about 🥋judo🥋, our new open-source sampling-based MPC toolbox! 🖥️Try in under 30s: pip install judo-rai judo <click link> 🧵thread below👇

Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

I know of folks training on non-specialized hardware and easily clearing 1,000,000 tokens/sec. So while impressive, I'm not convinced that specialized hardware is the way to go if the selling point is simply faster on the tokens/sec measure (they claim 500,000 btw). What is

Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

Settle a recent debate for me! Are constraints (compute, data scale, small teams, etc.) in academia sparking creativity or leading to an increasingly insurmountable knowledge gap?

Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

We really enjoy developing Graphite (we've moved to @graphite), particularly their stacking functionality! It makes a huge difference with larger teams. Bonus points for their support of academics 🫶🫶🫶

Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

You cannot tell someone both what to do and how to do it. People need creativity in at least one layer of this stack to feel motivated.

Sabera Talukder (@saberatalukder) 's Twitter Profile Photo

I love ubiquitous principles that transcend domains. For instance, consistency trumps any other type of forward progress. This is true in athletics, research, entrepreneurship, etc. What are the other ubiquitous principles you've observed (good or bad!)?