Milad Mohammadi (@_miladm_) 's Twitter Profile
Milad Mohammadi

@_miladm_

PyTorch @Meta Superintelligence Labs - Ex: @Google, @Stanford, @Nvidia, @Apple

ID: 788298072

linkhttp://cva.stanford.edu/people/milad/ calendar_today29-08-2012 01:59:35

123 Tweet

202 Followers

337 Following

Soumith Chintala (@soumithchintala) 's Twitter Profile Photo

Llama3 8B and 70B are out, with pretty exciting results! * The ~400B is still training but results already look promising. * Meta's own Chat interface is also live at meta.ai * TorchTune integration is shortly going live: github.com/pytorch/torcht…

Andrew Ng (@andrewyng) 's Twitter Profile Photo

Last week, I spoke about AI and regulations at an event at the U.S. Capitol attended by legislative and business leaders. I’m encouraged by the progress the open source community has made fending off regulations that would have stifled innovation. But opponents of open source are

Jeff Dean (@jeffdean) 's Twitter Profile Photo

Very nice results from a Google AI research effort on a general purpose time-series prediction model that gives good zero-shot performance to new forecasting tasks.

Milad Mohammadi (@_miladm_) 's Twitter Profile Photo

🚀 PyTorch TPU proud moment: vLLM TPU 🚀 At Google Cloud Next '25 keynote, Amin Vahdat announced vLLM on TPU. This is a HUGE breakthrough moment for TPU adoption! 🔥 Thank you Woosuk Kwon for your awesome contribution and collaboration.

🚀 PyTorch TPU proud moment: vLLM TPU 🚀

At Google Cloud Next '25 keynote, Amin Vahdat announced vLLM on TPU.

This is a HUGE breakthrough moment for TPU adoption! 🔥

Thank you <a href="/woosuk_k/">Woosuk Kwon</a> for your awesome contribution and collaboration.
a16z (@a16z) 's Twitter Profile Photo

.Fei-Fei Li on why LLMs will struggle to solve spatial intelligence: "Language is fundamentally a purely generated signal." "You don't go out in nature and there's words written in the sky for you." "There is a 3D world out there that follows laws of physics... to fundamentally

Thinking Machines (@thinkymachines) 's Twitter Profile Photo

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”

We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to
TestingCatalog News 🗞 (@testingcatalog) 's Twitter Profile Photo

BREAKING 🚨: Meta announced Meta Ray-Ban Display AI Glasses with an EMG Wristband! Did Zuck just kill the phone industry? 👀 Honestly, a wristband is a HUGE enabler, but there are significant questions about its quality.

Mira Murati (@miramurati) 's Twitter Profile Photo

Sharing our second Connectionism research post on Modular Manifolds, a mathematical approach to refining training at each layer of the neural network

Lilian Weng (@lilianweng) 's Twitter Profile Photo

GPUs are expensive and setting up the infrastructure to make GPUs work for you properly is complex, making experimentation on cutting-edge models challenging for researchers and ML practitioners. Providing high quality research tooling is one of the most effective ways to

GPUs are expensive and setting up the infrastructure to make GPUs work for you properly is complex, making experimentation on cutting-edge models challenging for researchers and ML practitioners. 

Providing high quality research tooling is one of the most effective ways to
maharshi (@mrsiipa) 's Twitter Profile Photo

it is common knowledge but i absolutely love how torch compile is able to fuse separate function calls doing some elementwise ops into a single triton kernel. it's like witnessing magic. we don't appreciate torch compile enough.

it is common knowledge but i absolutely love how torch compile is able to fuse separate function calls doing some elementwise ops into a single triton kernel. 

it's like witnessing magic. we don't appreciate torch compile enough.
Andrew Ng (@andrewyng) 's Twitter Profile Photo

Hanging out with Project Jupyter co-founder Brian Granger. If not for him and fernandoperez.org we wouldn’t have the coding notebooks we use daily in AI and Data Science. Very grateful to him and the whole Jupyter team for this wonderful open-source work!

Hanging out with Project Jupyter co-founder <a href="/ellisonbg/">Brian Granger</a>.  If not for him and <a href="/fperez_org/">fernandoperez.org</a> we wouldn’t have the coding notebooks we use daily in AI and Data Science. Very grateful to him and the whole Jupyter team for this wonderful open-source work!
Fei-Fei Li (@drfeifei) 's Twitter Profile Photo

It’s an honor to have received the Queen Elizabeth Prize for Engineering along with my fellow laureates! But it’s also a responsibility. AI’s impact to humanity is in the hands of all of us.

TIME (@time) 's Twitter Profile Photo

2025 was the year when artificial intelligence’s full potential roared into view, and when it became clear that there will be no turning back. For delivering the age of thinking machines, for wowing and worrying humanity, for transforming the present and transcending the

2025 was the year when artificial intelligence’s full potential roared into view, and when it became clear that there will be no turning back.

For delivering the age of thinking machines, for wowing and worrying humanity, for transforming the present and transcending the
Fei-Fei Li (@drfeifei) 's Twitter Profile Photo

This came as a total surprise this morning. Very humbled… 🙏 AI is built by generations of technologists, starting with the daring question of “can machines think?” by Alan Turing. It will be further developed, used and governed by many and all of us! Let’s keep our AI mission

PyTorch (@pytorch) 's Twitter Profile Photo

Today’s "Inside Helion Live Q&A" brought the #PyTorch community together with Jason Ansel, Oguz Ulgen, Wei (Will) Feng, and Jongsok Choi from Meta’s PyTorch Compiler and Helion teams. The discussion explored how Helion approaches kernel authoring, #AIInfrastructure performance, and

PyTorch (@pytorch) 's Twitter Profile Photo

Zhipeng (Jason) Wang, PhD (Zhipeng Wang 🇺🇦) explains how DeepSpeed supports ML training research and why joining PyTorch Foundation benefits researchers and developers working on AI training workloads. 🔗youtu.be/67719mlOSp0 #PyTorch #DeepSpeed #OpenSourceAI #AIInfrastructure

Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model. It found ~20 changes that improved the validation loss. I tested these changes yesterday and all of them were additive and transferred to larger (depth=24) models. Stacking up all of these changes,

Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model. It found ~20 changes that improved the validation loss. I tested these changes yesterday and all of them were additive and transferred to larger (depth=24) models. Stacking up all of these changes,