Humphrey Shi (@humphrey_shi) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Haicheng Wu

@asdf1234_0

6 months ago

CUTLASS is in the center of the CUDA Blackwell release blog. As always, we work hand in hand with CUDA team to deliver the next level performance. developer.nvidia.com/blog/cuda-tool…

thumb_up_off_alt125

chat_bubble_outline1

repeat25

shareShare

Great work by Google DeepMind on Matryoshka Quantization! Back in 2019, we introduced Any-Precision DNNs, enabling a single deep learning model to dynamically support any bit-widths without retraining. Excited to see how these ideas help Gemini/LLMs! arxiv.org/abs/1911.07346

Great work by <a href="/GoogleDeepMind/">Google DeepMind</a> on Matryoshka Quantization!

Back in 2019, we introduced Any-Precision DNNs, enabling a single deep learning model to dynamically support any bit-widths without retraining. Excited to see how these ideas help Gemini/LLMs!

arxiv.org/abs/1911.07346

thumb_up_off_alt121

chat_bubble_outline3

repeat10

shareShare

Bing Xu

@bingxu_

6 months ago

thumb_up_off_alt42

chat_bubble_outline2

repeat4

shareShare

Danfei Xu

@danfei_xu

6 months ago

Thrilled to share this story covering our collaboration with Project Aria @Meta Reality Labs at Meta ! Human data is robot data in disguise. Imitation learning is human modeling. We are at the beginning of something truly revolutionary, both for robotics and human-level AI beyond language.

thumb_up_off_alt165

chat_bubble_outline2

repeat19

shareShare

Jianwei Yang

@jw2yang4ai

6 months ago

Thanks for featuring our work! Aran Komatsuzaki. 🔥Today we are thrilled to announce our MSR flagship project Magma! This is a fully open-sourced project. We will roll out all the stuff: code, model and training data through the following days. Check out our full work here:

thumb_up_off_alt185

chat_bubble_outline7

repeat38

shareShare

Ali Hassani

@alihassanijr

6 months ago

Come see my presentation on Distributed GEMM in GPU MODE tomorrow!

thumb_up_off_alt22

chat_bubble_outline0

repeat1

shareShare

Humphrey Shi

@humphrey_shi

5 months ago

Five months after Hurricane Helene, we’re finally starting to rebuild our home. Grateful for my family’s resilience through this journey 🙏 Meanwhile, AI’s rapid progress feels like a hurricane, reshaping everything in its path. Perhaps we, too, should rebuild—forward & onward.

thumb_up_off_alt36

chat_bubble_outline3

repeat0

shareShare

Association for Computing Machinery

@theofficialacm

5 months ago

Meet the recipients of the 2024 ACM A.M. Turing Award, Andrew G. Barto and Richard S. Sutton! They are recognized for developing the conceptual and algorithmic foundations of reinforcement learning. Please join us in congratulating the two recipients! bit.ly/4hpdsbD

thumb_up_off_alt1,1K

chat_bubble_outline35

repeat479

shareShare

Humphrey Shi

@humphrey_shi

5 months ago

Congrats to Prof Barto & Sutton on Turing Award! 🎉 Barto’s journey is inspiring—I happen to be teaching McCulloch-Pitts neuron to hundreds of Georgia Tech undergrads today in my comp vision class.Makes me wonder what breakthroughs our next-gen leaders will achieve in 50 years🚀

Congrats to Prof Barto & Sutton on Turing Award! 🎉

Barto’s journey is inspiring—I happen to be teaching McCulloch-Pitts neuron to hundreds of <a href="/GeorgiaTech/">Georgia Tech</a> undergrads today in my comp vision class.Makes me wonder what breakthroughs our next-gen leaders will achieve in 50 years🚀

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Humphrey Shi

@humphrey_shi

5 months ago

Check out WorldModelBench, our first workshop on Benchmarking World Models #CVPR2025, lead by researchers from NVIDIA and beyond. Explore benchmarks, evaluation metrics, downstream tasks, & safety for World Models. Call for papers now open till April 7th: worldmodelbench.github.io

Check out WorldModelBench, our first workshop on Benchmarking World Models <a href="/CVPR/">#CVPR2025</a>, lead by researchers from <a href="/nvidia/">NVIDIA</a> and beyond.

Explore benchmarks, evaluation metrics, downstream tasks, & safety for World Models. Call for papers now open till April 7th: worldmodelbench.github.io

thumb_up_off_alt61

chat_bubble_outline0

repeat15

shareShare

Humphrey Shi

@humphrey_shi

5 months ago

Video/Physics Generative AI was bottlenecked by diffusion runtime— 5s used to take minutes. My student Ali Hassani Georgia Tech Computing helped scale full 35-step Cosmos 7B DiT 40× to real-time on Blackwell NVL72, in collab w/ NVIDIA Ming-Yu Liu’s team. Congrats—just the beginning!🐝🚀

Video/Physics Generative AI was bottlenecked by diffusion runtime— 5s used to take minutes.
My student <a href="/AliHassaniJr/">Ali Hassani</a> <a href="/gtcomputing/">Georgia Tech Computing</a> helped scale full 35-step Cosmos 7B DiT 40× to real-time on Blackwell NVL72, in collab w/ <a href="/nvidia/">NVIDIA</a> <a href="/liu_mingyu/">Ming-Yu Liu</a>’s team. Congrats—just the beginning!🐝🚀

thumb_up_off_alt74

chat_bubble_outline1

repeat8

shareShare

Devi Parikh

@deviparikh

4 months ago

One year ago, Abhishek Das and I left Meta to start Yutori. Ten months ago, Dhruv Batra joined us :) Nine months ago, we crystallized our vision. Two months ago, we released a sneak peak into what we’ve been building. Today, can’t be more excited to fully unveil Yutori’s

One year ago, <a href="/abhshkdz/">Abhishek Das</a> and I left Meta to start Yutori.
Ten months ago, <a href="/DhruvBatraDB/">Dhruv Batra</a> joined us :)
Nine months ago, we crystallized our vision.
Two months ago, we released a sneak peak into what we’ve been building.

Today, can’t be more excited to fully unveil <a href="/yutori_ai/">Yutori</a>’s

thumb_up_off_alt283

chat_bubble_outline16

repeat37

shareShare

Humphrey Shi

@humphrey_shi

4 months ago

Check out Slow-Fast Video MLLM — a new paradigm to empower multi-modal LLMs with longer video context and finer spatial detail! 🎥🧠 🔗 github.com/SHI-Labs/Slow-… Led by 🐝 Min Min Shi from Georgia Tech Computing, in collaboration with NVIDIA Zhiding Yu and more 🤝

thumb_up_off_alt124

chat_bubble_outline1

repeat21

shareShare

Humphrey Shi

@humphrey_shi

4 months ago

Huge congrats to Jiahui Yu and Bowen Cheng—our young and talented former IFP lab alumni—for their amazing work on GPT-4.1 at OpenAI.🎉 Proud and inspired — as one of the old guards watching the next wave take flight. The future clearly belongs to the young. Onward and upward! 🚀

Huge congrats to <a href="/jhyuxm/">Jiahui Yu</a> and <a href="/bowenc0221/">Bowen Cheng</a>—our young and talented former IFP lab alumni—for their amazing work on GPT-4.1 at OpenAI.🎉
Proud and inspired — as one of the old guards watching the next wave take flight. The future clearly belongs to the young. Onward and upward! 🚀

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Humphrey Shi

@humphrey_shi

4 months ago

After nearly 5 incredible years, I’ve stepped down from my role as Chief Scientist at Picsart. Grateful for the journey—from building AI Research from scratch to a global team creating products used by millions every day✨ Now exploring what’s next in multimodal AI🚀. DMs open🤝

thumb_up_off_alt36

chat_bubble_outline1

repeat0

shareShare

Bowen Cheng

@bowenc0221

4 months ago

"Thinking with Images" is what we have been cooking after GPT-4o launched last year and it marks a paradigm shift in how we view/solve perception problems in this new era of RL. It is such a pleasant and an honor to work with this amazing team to get it out!

thumb_up_off_alt130

chat_bubble_outline3

repeat7

shareShare

Humphrey Shi

@humphrey_shi

4 months ago

Impressed by FramePack from style2paints & Maneesh Agrawala! Their table puts our StreamingT2V (Mar 2024) at #2 overall and 🥇 in motion (99.96 %). A nice reminder that memory blocks still matter—and may fruitfully complement token‑compression and other approaches for marathon vids!🏃

Impressed by FramePack from <a href="/lvminzhang/">style2paints</a> & <a href="/magrawala/">Maneesh Agrawala</a>!
Their table puts our StreamingT2V (Mar 2024) at #2 overall and 🥇 in motion (99.96 %). A nice reminder that memory blocks still matter—and may fruitfully complement token‑compression and other approaches for marathon vids!🏃

thumb_up_off_alt22

chat_bubble_outline1

repeat1

shareShare

Ali Hassani

@alihassanijr

4 months ago

Wondering what's happening with NATTEN in 2025? Check out Generalized Neighborhood Attention! Spoiler: NATTEN gets a new stride parameter, we made a simulator for all your analytical studies, AND a Blackwell kernel! Keep reading for more... (1 / 5)

thumb_up_off_alt24

chat_bubble_outline1

repeat6

shareShare

Humphrey Shi

@humphrey_shi

3 months ago

A paper from my PhD students—nearly a year of work—was rejected by ICML Conference despite 4 weak accepts, citing “calibration with other submissions.” Still incredibly proud of my students. To young researchers: rejections happen. Keep learning, keep going—the real judge is within.

A paper from my PhD students—nearly a year of work—was rejected by <a href="/icmlconf/">ICML Conference</a> despite 4 weak accepts, citing “calibration with other submissions.”

Still incredibly proud of my students. To young researchers: rejections happen. Keep learning, keep going—the real judge is within.

thumb_up_off_alt64

chat_bubble_outline2

repeat2

shareShare