Tom St. John (@tstjohn_hpcml) 's Twitter Profile
Tom St. John

@tstjohn_hpcml

Technical lead for MTIA training performance at Meta AI; Ph.D. from University of Delaware; Brazilian jiu-jitsu and boxing practitioner; cat owner

ID: 1164564510805413888

calendar_today22-08-2019 15:46:40

297 Tweet

212 Takipçi

2,2K Takip Edilen

Davis Blalock (@davisblalock) 's Twitter Profile Photo

"DataPerf: Benchmarks for Data-Centric AI Development" What if instead of holding the data constant and benchmarking different models, we held the model constant and benchmarked different data pipelines? [1/7]

"DataPerf: Benchmarks for Data-Centric AI Development"

What if instead of holding the data constant and benchmarking different models, we held the model constant and benchmarked different data pipelines? [1/7]
HPC Guru (on an extended break) (@hpc_guru) 's Twitter Profile Photo

#Exascale and beyond 'myths' to be debunked: Satoshi Matsuoka at #ModSim2022 💯agree on the need to debunk the Intel HPC propagated myth that the next goal beyond Exscale is Zettascale in 2027 #HPC #AI via Simon McIntosh-Smith

#Exascale and beyond 'myths' to be debunked: <a href="/ProfMatsuoka/">Satoshi Matsuoka</a> at #ModSim2022

💯agree on the need to debunk the <a href="/intelhpc/">Intel HPC</a> propagated myth that the next goal beyond Exscale is Zettascale in 2027

#HPC #AI via <a href="/simonmcs/">Simon McIntosh-Smith</a>
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

!!!! Ok I recorded a (new!) 2h25m lecture on "The spelled-out intro to neural networks and backpropagation: building micrograd" youtube.com/watch?v=VMj-3S… . This is the culmination of about 8 years of obsessing about the best way to explain neural nets and backprop.

MLCommons (@mlcommons) 's Twitter Profile Photo

We are excited to release MedPerf platform, an open-source federated evaluation framework for AI model #benchmarking that promises to deliver new levels of clinical efficiency, prioritize patient privacy & mitigating legal and regulatory risks. medperf.org

MLCommons (@mlcommons) 's Twitter Profile Photo

Great session SC25 Nov 16 MLPerf: A Benchmark for M/L. Tom St John, Murali Emani, Geoffrey Fox, Oana Balmau, David Kanter, Piotr Luszczek discuss developments within #MLPerf org & update our recent MLPerf HPC benchmark results. sc22.supercomputing.org/presentation/?…

HPC Guru (on an extended break) (@hpc_guru) 's Twitter Profile Photo

For folks who did NOT register for #SC22, you can still watch Jack Dongarra's #TuringLecture on Youtube youtube.com/watch?v=ifVjPD… #HPC #AI

HPC Guru (on an extended break) (@hpc_guru) 's Twitter Profile Photo

#SC22: Reinventing High-Performance Computing Torsten Hoefler 🇨🇭 has an interesting take The network is the #supercomputer! The commoditization of #HPC is a strong sign of our impact We need to reinvent ourselves

#SC22: Reinventing High-Performance Computing

<a href="/thoefler/">Torsten Hoefler 🇨🇭</a> has an interesting take 

The network is the #supercomputer!

The commoditization of #HPC is a strong sign of our impact

We need to reinvent ourselves
Andrew Ng (@andrewyng) 's Twitter Profile Photo

ChatGPT is sometimes amazing, and sometimes hilariously wrong. Here, it confidently explains to me why an abacus is faster than a GPU. 😃

ChatGPT is sometimes amazing, and sometimes hilariously wrong. Here, it confidently explains to me why an abacus is faster than a GPU. 😃
Xavier Bresson (@xbresson) 's Twitter Profile Photo

Our paper "Benchmarking Graph Neural Networks" has been accepted for publication at Journal of Machine Learning Research Journal of Machine Learning Research! arxiv.org/pdf/2003.00982… (after rejection from NeurIPS, ICLR and ICML :)

Our paper "Benchmarking Graph Neural Networks" has been accepted for publication at Journal of Machine Learning Research <a href="/JmlrOrg/">Journal of Machine Learning Research</a>! 
arxiv.org/pdf/2003.00982…

(after rejection from NeurIPS, ICLR and ICML :)
Vivek Natarajan (@vivnat) 's Twitter Profile Photo

Delighted to share our new Google Health Google AI @Deepmind paper at the intersection of LLMs + health. Our LLMs building on Flan-PaLM reach SOTA on multiple medical question answering datasets including 67.6% on MedQA USMLE (+17% over prior work). arxiv.org/abs/2212.13138

Delighted to share our new <a href="/GoogleHealth/">Google Health</a> <a href="/GoogleAI/">Google AI</a> @Deepmind paper at the intersection of LLMs + health.

Our LLMs building on Flan-PaLM reach SOTA on multiple medical question answering datasets including 67.6% on MedQA USMLE (+17% over prior work).

arxiv.org/abs/2212.13138
Horace He (@chhillee) 's Twitter Profile Photo

Let's talk about a detail that occurs during PyTorch 2.0's codegen - tiling. In many cases, tiling is needed to generate efficient kernels. Even for something as basic as torch.add(A, B), you might need tiling to be efficient! But what is tiling? And when is it needed? (1/13)

Let's talk about a detail that occurs during PyTorch 2.0's codegen - tiling.

In many cases, tiling is needed to generate efficient kernels. Even for something as basic as torch.add(A, B), you might need tiling to be efficient! But what is tiling? And when is it needed? 

(1/13)
Abdulrahman Mahmoud (@arhmahmoud) 's Twitter Profile Photo

Call for applications for the inaugural Machine Learning and Systems Rising Stars 2023 workshop! Website: sites.google.com/g.harvard.edu/…

Tom St. John (@tstjohn_hpcml) 's Twitter Profile Photo

If you're planning to attend Hot Chips, come check out our ML inference tutorial on Sunday. We've got a great line-up of speakers from NVIDIA AI, Berkeley AI Research, Qualcomm, @MetaAI, and Moffett AI. #HotChips35 hotchips.org

If you're planning to attend <a href="/hotchipsorg/">Hot Chips</a>, come check out our ML inference tutorial on Sunday.  We've got a great line-up of speakers from <a href="/NVIDIAAI/">NVIDIA AI</a>, <a href="/berkeley_ai/">Berkeley AI Research</a>, <a href="/Qualcomm/">Qualcomm</a>, @MetaAI, and Moffett AI.

#HotChips35
hotchips.org
Sharon Zhou (@realsharonzhou) 's Twitter Profile Photo

Excited to announce a HUGE secret with Lisa Su: Lamini has been building LLMs on AMD GPUs *in production* for over the past year! We’ve made running LLMs on AMD super easy and a highly competitive option through our LLM Superstation, available now at ~10x lower cost than

Google AI (@googleai) 's Twitter Profile Photo

Today on the blog, learn how we’re supporting a new effort by the non-profit MLCommons Association that aims to bring together expert researchers across academia and industry to develop standard AI safety benchmarks that everyone can use and understand. ↓ goo.gle/3QtydZf

MLCommons (@mlcommons) 's Twitter Profile Photo

SC23 attendees join MLCommons BOF sessions to add your voice to the MLCommons community. Wed, 11/15, 5:15pm in Rm 601-603 MLPerf: A Benchmark for Machine Learning, or in Rm 702 join the conversation around the Future of Benchmarks in Supercomputing. sc23.supercomputing.org/program/ #SC23

SC23 attendees join MLCommons BOF sessions to add your voice to the <a href="/MLCommons/">MLCommons</a> community. Wed, 11/15, 5:15pm in Rm 601-603 MLPerf: A Benchmark for Machine Learning, or in Rm 702 join the conversation around the Future of Benchmarks in Supercomputing. sc23.supercomputing.org/program/ #SC23
MLCommons (@mlcommons) 's Twitter Profile Photo

Introducing the AlgoPerf: Training Algorithms Benchmark! Compete for a share of the $50,000 prize pool by submitting more effective and efficient neural network training algorithms. Learn more mlcommons.org/2023/11/mlc-al… #Algorithms #MachineLearning #Competition

Demis Hassabis (@demishassabis) 's Twitter Profile Photo

The Gemini era is here. Thrilled to launch Gemini 1.0, our most capable & general AI model. Built to be natively multimodal, it can understand many types of info. Efficient & flexible, it comes in 3 sizes each best-in-class & optimized for different uses blog.google/technology/ai/…

The Gemini era is here. Thrilled to launch Gemini 1.0, our most capable &amp; general AI model. Built to be natively multimodal, it can understand many types of info. Efficient &amp; flexible, it comes in 3 sizes each best-in-class &amp; optimized for different uses blog.google/technology/ai/…