Humphrey Shi (@humphrey_shi) 's Twitter Profile
Humphrey Shi

@humphrey_shi

associate professor @ICatGT @MLatGT @GTComputing @GeorgiaTech l grad faculty @UOregon @UofIllinois | chief scientist @Picsart

ID: 540786666

linkhttps://www.humphreyshi.com calendar_today30-03-2012 10:20:29

278 Tweet

1,1K Followers

70 Following

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Kudos to Georgia Tech alumni @DROP_ALL_TABLES and Haicheng Wu for their cool work on the CUTLASS library NVIDIA, enabling efficient AI implementations like FlashAttention-3 and our recent FusedNA from Ali Hassani. CUTLASS is becoming a key layer for AI: github.com/NVIDIA/cutlass

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Generative AI is a powerful tool. Combined with Responsible AI we can create more positive societal impacts! Check out The New York Times on the efforts from Getty Images Picsart AI Research (PAIR) NVIDIA and more. The cover photo is my favorite from one of our earlier models. nytimes.com/2024/07/19/tec…

Generative AI is a powerful tool. Combined with Responsible AI we can create more positive societal impacts!

Check out <a href="/nytimes/">The New York Times</a> on the efforts from <a href="/GettyImages/">Getty Images</a> <a href="/PicsartAI/">Picsart AI Research (PAIR)</a> <a href="/nvidia/">NVIDIA</a> and more. The cover photo is my favorite from one of our earlier models.
nytimes.com/2024/07/19/tec…
Irfan Essa (@irrfaan) 's Twitter Profile Photo

Check out our paper on video generation entitled “VideoPoet: A large language model for zero-shot video generation”. Being presented today (July 23, 2024) at #ICML2024. sites.research.google/videopoet/ Also awarded the best paper award. Congrats to all my coauthors for this amazing work.

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Congratulations to our Georgia Tech School of Interactive Computing colleagues for their achievements at ICML Conference!🎉 Irfan Essa: Best Paper🏆 for VideoPoet with Google DeepMind (including several GT alumni at Google Atlanta) and Judy Hoffman: Test of Time🏅 for DeCAF! More from GT this year: sites.gatech.edu/research/icml-…

Jianwei Yang (@jw2yang4ai) 's Twitter Profile Photo

Hmm, it does read familiar, :), bring back to the memory when we work on SEEM and apply it to promotable video segmentation in a zero-shot manner. Wonder why they do not even mention our work at all.

Stanford AI Lab (@stanfordailab) 's Twitter Profile Photo

arXiv -> alphaXiv Students at Stanford have built alphaXiv, an open discussion forum for arXiv papers. alphaXiv You can post questions and comments directly on top of any arXiv paper by changing arXiv to alphaXiv in any URL!

Georgia Tech School of Interactive Computing (@icatgt) 's Twitter Profile Photo

We're excited to have Lynn Dombrowski, Naveena Karusala, and Cindy Lin join our faculty this fall! With their leadership and expertise, we will continue to create a positive social impact as we strive toward responsible computing. b.gatech.edu/3SDANN7

Horace He (@chhillee) 's Twitter Profile Photo

For too long, users have lived under the software lottery tyranny of fused attention implementations. No longer. Introducing FlexAttention, a new PyTorch API allowing for many attention variants to enjoy fused kernels in a few lines of PyTorch. pytorch.org/blog/flexatten… 1/10

For too long, users have lived under the software lottery tyranny of fused attention implementations. 

No longer. 

Introducing FlexAttention, a new PyTorch API allowing for many attention variants to enjoy fused kernels in a few lines of PyTorch.
pytorch.org/blog/flexatten…
1/10
AK (@_akhaliq) 's Twitter Profile Photo

Nvidia presents Eagle Exploring The Design Space for Multimodal LLMs with Mixture of Encoders discuss: huggingface.co/papers/2408.15… The ability to accurately interpret complex visual information is a crucial topic of multimodal large language models (MLLMs). Recent work indicates

Nvidia presents Eagle

Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

discuss: huggingface.co/papers/2408.15…

The ability to accurately interpret complex visual information is a crucial topic of multimodal large language models (MLLMs). Recent work indicates
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Check out 🦅EAGLE, a new open-source vision-centric, high-resolution multimodal LLM with enhanced visual perception by exploring a mixture of vision encoders huggingface.co/spaces/NVEagle… A year of hard work by my student Min Georgia Tech Computing , in collaboration with NVIDIA and others🤝🎉

Check out 🦅EAGLE, a new open-source vision-centric, high-resolution multimodal LLM with enhanced visual perception by exploring a mixture of vision encoders huggingface.co/spaces/NVEagle…

A year of hard work by my student Min <a href="/gtcomputing/">Georgia Tech Computing</a> , in collaboration with <a href="/nvidia/">NVIDIA</a> and others🤝🎉
merve (@mervenoyann) 's Twitter Profile Photo

NVIDIA just dropped NVEagle 🦅 Super impressive vision language model that comes in 7B, 13B and 13B fine-tuned on chat, improved visual perception with MoE vision encoders 💬 Keep reading for details and links ⇓

NVIDIA just dropped NVEagle 🦅

Super impressive vision language model that comes in 7B, 13B and 13B fine-tuned on chat, improved visual perception with MoE vision encoders 💬

Keep reading for details and links ⇓
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

We’ve open-sourced a new version of StreamingT2V, an advanced autoregressive method for generating high-quality, long videos with rich motion dynamics. github.com/Picsart-AI-Res… Thanks to the Picsart AI Research (PAIR) team for their outstanding work in democratizing AI! x.com/humphrey_shi/s…

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

We just hosted a Student Conference Machine Learning at Georgia Tech, bringing together students/fac from across campus to build a stronger community. Grateful to all participants & proud of the incredible work showcased! It was a success cuz we ran out of food :) Hear our Atlanta AI Buzz Georgia Tech!🐝

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Fascinating to be reminded that image manipulation dates back to the 1800s, especially in today’s generative AI era. Big thanks to Walter Scheirer for an insightful talk at Machine Learning at Georgia Tech ! Check out his new book on the history of fake images and their social impact: sup.org/books/title/?i…

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

Congratulations to Fei-Fei Li Justin Johnson and the entire team at World Labs on their exciting new adventure! The future of computer vision lies in how we understand, emulate, and interact with the visual world. Can’t wait to see the incredible things you’ll build for our world!🎆

Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

The Midwest has a quiet, unassuming charm—small towns, sunrises, sunsets, and hospitality that work their way under your skin. After years there, I’ve realized those moments stick with you. Visiting friends’ university reminded me, deep down, I’m still a Midwest person at heart.

The Midwest has a quiet, unassuming charm—small towns, sunrises, sunsets, and hospitality that work their way under your skin. After years there, I’ve realized those moments stick with you. Visiting friends’ university reminded me, deep down, I’m still a Midwest person at heart.
Humphrey Shi (@humphrey_shi) 's Twitter Profile Photo

What drives me to the office every morning is the incredible energy around Georgia Tech in Midtown Atlanta, surrounded by landmarks like IBM Tower and Google. The buzz fills me with hope that our future innovators might build the next Google/IBM right here in the Southeast US🐝