Abhinav Shukla(@Abhinav95_) 's Twitter Profileg
Abhinav Shukla

@Abhinav95_

Researcher at Scaled Foundations. ex: Meta (Reality Labs Research), PhD at Imperial College London, IIIT Hyderabad.

ID:154435289

linkhttps://abhinav95.github.io/ calendar_today11-06-2010 06:35:49

342 Tweets

624 Followers

1,2K Following

Armen Aghajanyan(@ArmenAgha) 's Twitter Profile Photo

It's very easy to make things seem like they work/don't work at smaller scales. Think about how many efficient attention papers were published in the last 5 years. Do we use any of them?

Training at a larger scale has fundamentally different training dynamics/properties than the

account_circle
Yangqing Jia(@jiayq) 's Twitter Profile Photo

I probably have some credibility as a person who has worked on TensorFlow and PyTorch both (also, caffe / ONNX / distbelief / a few others that never saw the light), so here are my two cents:

(1) Speed doesn't really matter today as long as it is not particularly bad.

account_circle
Scaled Foundations(@ScaFoAI) 's Twitter Profile Photo

Welcome David Merrill as an advisor to Scaled Foundations! David is the CEO and co-founder of Elroy Air where he is building large autonomous delivery aircraft systems, and shares our mission to build safe AI-powered robots.

A fellow MIT alum, David is also a TED speaker and an avid

Welcome David @Merrill as an advisor to @ScaFoAI! David is the CEO and co-founder of @elroyair where he is building large autonomous delivery aircraft systems, and shares our mission to build safe AI-powered robots. A fellow MIT alum, David is also a TED speaker and an avid
account_circle
Scaled Foundations(@ScaFoAI) 's Twitter Profile Photo

If you’re at this week, come hear Scaled Foundations’ CTO Sai Vemprala discuss 'The Impact of Generative AI on Robotics'.

Look forward to seeing you there!

Wednesday, 3/20 at 2:00 pm
SJCC 230B (L2)

nvidia.com/gtc/session-ca…

If you’re at #GTC2024 this week, come hear Scaled Foundations’ CTO @saihv discuss 'The Impact of Generative AI on Robotics'. Look forward to seeing you there! Wednesday, 3/20 at 2:00 pm SJCC 230B (L2) nvidia.com/gtc/session-ca… #ai #robotics #generativeai
account_circle
Wenqi Jia(@Wenqi_Jia) 's Twitter Profile Photo

🗣Exploring beyond interpreting actions that directly involve the camera wearer, how can egocentric audio-visual signals aid in understanding the natural social behaviors among all partners, ultimately enhancing our daily communication?
Project: vjwq.github.io/AV-CONV/

🗣Exploring beyond interpreting actions that directly involve the camera wearer, how can egocentric audio-visual signals aid in understanding the natural social behaviors among all partners, ultimately enhancing our daily communication? #CVPR24 Project: vjwq.github.io/AV-CONV/
account_circle
Ashish Kapoor(@akapoor_av8r) 's Twitter Profile Photo

For folks getting excited about 'Next Token Prediction' in robotics, I present 2-year-old work on Perception-Action Causal Transformer (PACT). 🧵

For folks getting excited about 'Next Token Prediction' in robotics, I present 2-year-old work on Perception-Action Causal Transformer (PACT). 🧵
account_circle
Aditya Kusupati(@adityakusupati) 's Twitter Profile Photo

To be clear, we are very happy that OpenAI adopted it & now even more people will continue to innovate on it. Products should use academic research!

However, from the pov of a grad student, it would have meant so much to us if there was attribution.

Thanks for all the love 🩷

To be clear, we are very happy that @OpenAI adopted it & now even more people will continue to innovate on it. Products should use academic research! However, from the pov of a grad student, it would have meant so much to us if there was attribution. Thanks for all the love 🩷
account_circle
Aditya Kusupati(@adityakusupati) 's Twitter Profile Photo

🤯WOW🪆Matryoshka Representation Learning enables 'native support for shortening embs' &'very flexible usage'

Jokes aside, excited that OpenAI serves MRL by default in v3 embedding API for retrieval & RAG!

Other models & services should catch-up soon😄

arxiv.org/abs/2205.13147

🤯WOW🪆Matryoshka Representation Learning enables 'native support for shortening embs' &'very flexible usage' Jokes aside, excited that @OpenAI serves MRL by default in v3 embedding API for retrieval & RAG! Other models & services should catch-up soon😄 arxiv.org/abs/2205.13147
account_circle
Abhinav Shukla(@Abhinav95_) 's Twitter Profile Photo

A Seattle icon and role model. I will always remember his infectious energy and leadership at a young age of 70. Thanks for the memories!

account_circle
Abhinav Shukla(@Abhinav95_) 's Twitter Profile Photo

Aditya is one of the best young researchers and an incredible mentor. He has been leading high impact work in representation learning for a while. You should absolutely hire him!

account_circle
Ishan Misra(@imisra_) 's Twitter Profile Photo

World meet
For the past year, our team has been pushing on video generation. The result? Emu Video that generates high quality videos from text or images. SOTA performance vs. commercial products and academic papers. Check it out emu-video.metademolab.com

account_circle
Jeff Dean (@🏡)(@JeffDean) 's Twitter Profile Photo

keveman Lucas Beyer (bl16) This is roughly right. Basically wanted to send fewer bytes over the network for our distributed neural network training system, and easiest way on a CPU was to lop off the low 16 bits of mantissa, and fill with 0s on other side. Turns out it was fine for training.

account_circle
Prateek Jain(@jainprateek_) 's Twitter Profile Photo

Aditya Kusupati is on the job market this year. If you are looking for a super smart, driven, and passionate researcher+leader, you should definitely contact him!
cc: Sham Kakade, UW RAIVN Lab, Inderjit Dhillon, Manish Gupta Jeff Dean (@🏡)

account_circle
Abhinav Shukla(@Abhinav95_) 's Twitter Profile Photo

This is one of my favorite works in a long time. A simple, elegant, and extremely impactful idea that you can use in as little as 5-6 lines of code in existing Transformer architectures. Well done Aditya Kusupati, Sneha Kudugunta and co!

account_circle
Aditya Kusupati(@adityakusupati) 's Twitter Profile Photo

Announcing MatFormer - a nested🪆(Matryoshka) Transformer that offers elasticity across deployment constraints.

MatFormer is an architecture that lets us use 100s of accurate smaller models that we never actually trained for!

arxiv.org/abs/2310.07707 1/9

Announcing MatFormer - a nested🪆(Matryoshka) Transformer that offers elasticity across deployment constraints. MatFormer is an architecture that lets us use 100s of accurate smaller models that we never actually trained for! arxiv.org/abs/2310.07707 1/9
account_circle
Scaled Foundations(@ScaFoAI) 's Twitter Profile Photo

Presenting AirGen, the next-gen aerial robotics simulator and evolution of ! AirGen adds several new features to synthesize data for aerial intelligence. Currently available in preview and free/unrestricted for academic purposes. Key features include: 🧵👇 (1/5)

account_circle