Humphrey Shi (@humphrey_shi) 's Twitter Profile
Humphrey Shi

@humphrey_shi

Associate Professor @GeorgiaTech | Building high-performance multimodal AI systems to empower creativity in the service of humanity.

ID: 540786666

linkhttps://www.humphreyshi.com calendar_today30-03-2012 10:20:29

328 Tweet

2,2K Takipçi

36 Takip Edilen

Haicheng Wu (@asdf1234_0) 's Twitter Profile Photo

CUTLASS is in the center of the CUDA Blackwell release blog. As always, we work hand in hand with CUDA team to deliver the next level performance. developer.nvidia.com/blog/cuda-tool…

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Great work by Google DeepMind on Matryoshka Quantization! Back in 2019, we introduced Any-Precision DNNs, enabling a single deep learning model to dynamically support any bit-widths without retraining. Excited to see how these ideas help Gemini/LLMs! arxiv.org/abs/1911.07346

Great work by <a href="/GoogleDeepMind/">Google DeepMind</a> on Matryoshka Quantization!

Back in 2019, we introduced Any-Precision DNNs, enabling a single deep learning model to dynamically support any bit-widths without retraining. Excited to see how these ideas help Gemini/LLMs!

arxiv.org/abs/1911.07346
Danfei Xu (@danfei_xu) 's Twitter Profile Photo

Thrilled to share this story covering our collaboration with Project Aria @Meta Reality Labs at Meta ! Human data is robot data in disguise. Imitation learning is human modeling. We are at the beginning of something truly revolutionary, both for robotics and human-level AI beyond language.

Jianwei Yang (@jw2yang4ai) 's Twitter Profile Photo

Thanks for featuring our work! Aran Komatsuzaki. 🔥Today we are thrilled to announce our MSR flagship project Magma! This is a fully open-sourced project. We will roll out all the stuff: code, model and training data through the following days. Check out our full work here:

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Five months after Hurricane Helene, we’re finally starting to rebuild our home. Grateful for my family’s resilience through this journey 🙏 Meanwhile, AI’s rapid progress feels like a hurricane, reshaping everything in its path. Perhaps we, too, should rebuild—forward & onward.

Five months after Hurricane Helene, we’re finally starting to rebuild our home. Grateful for my family’s resilience through this journey 🙏

Meanwhile, AI’s rapid progress feels like a hurricane, reshaping everything in its path. Perhaps we, too, should rebuild—forward &amp; onward.
Association for Computing Machinery (@theofficialacm) 's Twitter Profile Photo

Meet the recipients of the 2024 ACM A.M. Turing Award, Andrew G. Barto and Richard S. Sutton! They are recognized for developing the conceptual and algorithmic foundations of reinforcement learning. Please join us in congratulating the two recipients! bit.ly/4hpdsbD

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Congrats to Prof Barto & Sutton on Turing Award! 🎉 Barto’s journey is inspiring—I happen to be teaching McCulloch-Pitts neuron to hundreds of Georgia Tech undergrads today in my comp vision class.Makes me wonder what breakthroughs our next-gen leaders will achieve in 50 years🚀

Congrats to Prof Barto &amp; Sutton on Turing Award! 🎉

Barto’s journey is inspiring—I happen to be teaching McCulloch-Pitts neuron to hundreds of <a href="/GeorgiaTech/">Georgia Tech</a> undergrads today in my comp vision class.Makes me wonder what breakthroughs our next-gen leaders will achieve in 50 years🚀
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Check out WorldModelBench, our first workshop on Benchmarking World Models #CVPR2025, lead by researchers from NVIDIA and beyond. Explore benchmarks, evaluation metrics, downstream tasks, & safety for World Models. Call for papers now open till April 7th: worldmodelbench.github.io

Check out WorldModelBench, our first workshop on Benchmarking World Models <a href="/CVPR/">#CVPR2025</a>, lead by researchers from <a href="/nvidia/">NVIDIA</a>  and beyond.

Explore benchmarks, evaluation metrics, downstream tasks, &amp; safety for World Models. Call for papers now open till April 7th: worldmodelbench.github.io
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Video/Physics Generative AI was bottlenecked by diffusion runtime— 5s used to take minutes. My student Ali Hassani Georgia Tech Computing helped scale full 35-step Cosmos 7B DiT 40× to real-time on Blackwell NVL72, in collab w/ NVIDIA Ming-Yu Liu’s team. Congrats—just the beginning!🐝🚀

Video/Physics Generative AI was bottlenecked by diffusion runtime— 5s used to take minutes.
My student <a href="/AliHassaniJr/">Ali Hassani</a> <a href="/gtcomputing/">Georgia Tech Computing</a> helped scale full 35-step Cosmos 7B DiT 40× to real-time on Blackwell NVL72, in collab w/ <a href="/nvidia/">NVIDIA</a> <a href="/liu_mingyu/">Ming-Yu Liu</a>’s team. Congrats—just the beginning!🐝🚀
Devi Parikh (@deviparikh) 's Twitter Profile Photo

One year ago, Abhishek Das and I left Meta to start Yutori. Ten months ago, Dhruv Batra joined us :) Nine months ago, we crystallized our vision. Two months ago, we released a sneak peak into what we’ve been building. Today, can’t be more excited to fully unveil Yutori’s

One year ago, <a href="/abhshkdz/">Abhishek Das</a> and I left Meta to start Yutori. 
Ten months ago, <a href="/DhruvBatraDB/">Dhruv Batra</a> joined us :)
Nine months ago, we crystallized our vision.
Two months ago, we released a sneak peak into what we’ve been building.

Today, can’t be more excited to fully unveil <a href="/yutori_ai/">Yutori</a>’s
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Check out Slow-Fast Video MLLM — a new paradigm to empower multi-modal LLMs with longer video context and finer spatial detail! 🎥🧠 🔗 github.com/SHI-Labs/Slow-… Led by 🐝 Min Min Shi from Georgia Tech Computing, in collaboration with NVIDIA Zhiding Yu and more 🤝

Check out Slow-Fast Video MLLM — a new paradigm to empower multi-modal LLMs with longer video context and finer spatial detail! 🎥🧠

🔗 github.com/SHI-Labs/Slow-…

Led by 🐝 Min <a href="/__flying_lynx__/">Min Shi</a> from <a href="/gtcomputing/">Georgia Tech Computing</a>, in collaboration with <a href="/nvidia/">NVIDIA</a> <a href="/ZhidingYu/">Zhiding Yu</a> and more 🤝
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Huge congrats to Jiahui Yu and Bowen Cheng—our young and talented former IFP lab alumni—for their amazing work on GPT-4.1 at OpenAI.🎉 Proud and inspired — as one of the old guards watching the next wave take flight. The future clearly belongs to the young. Onward and upward! 🚀

Huge congrats to <a href="/jhyuxm/">Jiahui Yu</a> and <a href="/bowenc0221/">Bowen Cheng</a>—our young and talented former IFP lab alumni—for their amazing work on GPT-4.1 at OpenAI.🎉
Proud and inspired — as one of the old guards watching the next wave take flight. The future clearly belongs to the young. Onward and upward! 🚀
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

After nearly 5 incredible years, I’ve stepped down from my role as Chief Scientist at Picsart. Grateful for the journey—from building AI Research from scratch to a global team creating products used by millions every day✨ Now exploring what’s next in multimodal AI🚀. DMs open🤝

Bowen Cheng (@bowenc0221) 's Twitter Profile Photo

"Thinking with Images" is what we have been cooking after GPT-4o launched last year and it marks a paradigm shift in how we view/solve perception problems in this new era of RL. It is such a pleasant and an honor to work with this amazing team to get it out!

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Impressed by FramePack from style2paints & Maneesh Agrawala! Their table puts our StreamingT2V (Mar 2024) at #2 overall and 🥇 in motion (99.96 %). A nice reminder that memory blocks still matter—and may fruitfully complement token‑compression and other approaches for marathon vids!🏃

Impressed by FramePack from <a href="/lvminzhang/">style2paints</a> &amp; <a href="/magrawala/">Maneesh Agrawala</a>! 
Their table puts our StreamingT2V (Mar 2024)  at #2 overall and 🥇 in motion (99.96 %). A nice reminder that memory blocks still matter—and may fruitfully complement token‑compression and other approaches for marathon vids!🏃
Ali Hassani (@alihassanijr) 's Twitter Profile Photo

Wondering what's happening with NATTEN in 2025? Check out Generalized Neighborhood Attention! Spoiler: NATTEN gets a new stride parameter, we made a simulator for all your analytical studies, AND a Blackwell kernel! Keep reading for more... (1 / 5)

Wondering what's happening with NATTEN in 2025?
Check out Generalized Neighborhood Attention!

Spoiler: NATTEN gets a new stride parameter, we made a simulator for all your analytical studies, AND a Blackwell kernel!

Keep reading for more...

(1 / 5)
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

A paper from my PhD students—nearly a year of work—was rejected by ICML Conference despite 4 weak accepts, citing “calibration with other submissions.” Still incredibly proud of my students. To young researchers: rejections happen. Keep learning, keep going—the real judge is within.

A paper from my PhD students—nearly a year of work—was rejected by <a href="/icmlconf/">ICML Conference</a> despite 4 weak accepts, citing “calibration with other submissions.”

Still incredibly proud of my students. To young researchers: rejections happen. Keep learning, keep going—the real judge is within.