Michał Jamry (@jamrymichal) 's Twitter Profile
Michał Jamry

@jamrymichal

ID: 1400537899

calendar_today03-05-2013 19:22:46

227 Tweet

44 Followers

1,1K Following

Suhail (@suhail) 's Twitter Profile Photo

What are some foundational ML/AI books? I'd like to invest in my fundamentals more. Ideally something students might read during their PhD.

Yann LeCun (@ylecun) 's Twitter Profile Photo

We've barely scratched the surface of the space of deep learning architectures. It's a high dimensional space, so the volume is almost entirely contained in the surface. But we've scratched a tiny subset of the surface.

Michael Black (@michael_j_black) 's Twitter Profile Photo

Multi-modal #LLMs understand a lot about humans. But do they understand our 3D pose? We train #PoseGPT to estimate, generate, and reason about 3D human pose (#SMPL) in images and text. This is the first true foundation model for understanding 3D humans. yfeng95.github.io/posegpt/

Aravind Srinivas (@aravsrinivas) 's Twitter Profile Photo

Tomas Mikolov, the OG and inventor of word2vec, gives this thoughts on the test of time award, and the current state of NLP, and chatGPT. 🍿

Tomas Mikolov, the OG and inventor of word2vec, gives this thoughts on the test of time award, and the current state
of NLP, and chatGPT. 🍿
Jascha Sohl-Dickstein (@jaschasd) 's Twitter Profile Photo

Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.

Hieu Pham (@hyhieu226) 's Twitter Profile Photo

research.colfax-intl.com/tutorial-matri… If you learn CUDA, you probably have read the legendary matrix transpose tutorial by Mark Harris. 📚Today, I present to you a CUDA tutorial by friends at Colfax International and myself. We explore many CUDA memory concepts like in Mark's tutorial, and more.

Yann LeCun (@ylecun) 's Twitter Profile Photo

Yet another opportunity to point out that reasoning abilities and common sense should not be confused with an ability to store and approximately retrieve many facts.

Jakob Foerster (@j_foerst) 's Twitter Profile Photo

When I discussed quitting Google to do a Phd, my manager, Steve Cheng, gave me the advice of "6 shots": Doing something meaningful usually takes about 5 years and we are productive for roughly 30 years. That gives you 6 attempts. So pick each one carefully and give it your best.

leloy! (@leloykun) 's Twitter Profile Photo

Deep Learning Optimizers from First Principles Now with more maths! In this thread, I'll discuss: 1. The difference between 1st order gradient dualizaton approaches and 2nd order optimization approaches. 2. Preconditioning--how to do it and why. 3. How to derive a couple of

Deep Learning Optimizers from First Principles

Now with more maths!

In this thread, I'll discuss:

1. The difference between 1st order gradient dualizaton approaches and 2nd order optimization approaches.
2. Preconditioning--how to do it and why.
3. How to derive a couple of
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

The (true) story of development and inspiration behind the "attention" operator, the one in "Attention is All you Need" that introduced the Transformer. From personal email correspondence with the author 🇺🇦 Dzmitry Bahdanau @ NeurIPS ~2 years ago, published here and now (with permission) following

The (true) story of development and inspiration behind the "attention" operator, the one in "Attention is All you Need" that introduced the Transformer. From personal email correspondence with the author <a href="/DBahdanau/">🇺🇦 Dzmitry Bahdanau @ NeurIPS</a> ~2 years ago, published here and now (with permission) following
Zhenjun Zhao (@zhenjun_zhao) 's Twitter Profile Photo

SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment Qi Xu, Dongxu Wei, Lingzhe Zhao, Wenpu Li (李文朴), Zhangchi Huang, Shunping Ji, Peidong Liu tl;dr: reconstruction and understanding->pixel-aligned 2D-to-3D lifting->unified learnable queries

SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment

Qi Xu, Dongxu Wei, Lingzhe Zhao, <a href="/pu_wen99907/">Wenpu Li (李文朴)</a>, Zhangchi Huang, Shunping Ji, <a href="/PeidongLiu_/">Peidong Liu</a>

tl;dr: reconstruction and understanding-&gt;pixel-aligned 2D-to-3D lifting-&gt;unified learnable queries