Mike Shou (@mikeshou1) Twitter Tweets • TwiCopy

Mike Shou

@mikeshou1

+ Follow

Asst Prof at NUS. Forbes 30 under 30 Asia. Previously at Facebook AI and Columbia U. Passionate about video, multi-modal, AI assistant.

ID: 1336190898665705472

linkhttps://sites.google.com/view/showlab calendar_today08-12-2020 06:08:39

188 Tweet

1,1K Followers

436 Following

Kevin Lin

@kevinqhlin

7 months ago

Hi friends! Attending ICLR 2025 in Singapore? Interested in visiting the NUS campus NUS? 🎉 Join us for the Multimodal Gathering Workshop @ NUS! 🙌 A half‑day meetup for researchers and students to exchange ideas, showcase work, and explore the latest in multimodal AI.

thumb_up_off_alt66

chat_bubble_outline1

repeat8

shareShare

YUCHAO GU

@yuchaogu

7 months ago

Our previous work, FAR, proposes a next-frame prediction paradigm based on long short-term context modeling with asymmetric patchification. Paper: arxiv.org/abs/2503.19325 Code: github.com/showlab/FAR Glad to see this idea adopted and extended in FramePack, demonstrating

thumb_up_off_alt16

chat_bubble_outline1

repeat4

shareShare

Mike Shou

@mikeshou1

7 months ago

Insightful keynote from Violet Peng at SSNLP about creativity and control — To fulfill a meaningful goal in a constrained setting, one naturally has to be innovative! can’t agree more

Insightful keynote from <a href="/VioletNPeng/">Violet Peng</a> at SSNLP about creativity and control — To fulfill a meaningful goal in a constrained setting, one naturally has to be innovative! can’t agree more

thumb_up_off_alt21

chat_bubble_outline0

repeat1

shareShare

AK

@_akhaliq

7 months ago

LiveCC just dropped on Hugging Face Learning Video LLM with Streaming Speech Transcription at Scale video LLM capable of real-time commentary, trained with a novel video-ASR streaming method, SOTA on both streaming and offline benchmarks.

thumb_up_off_alt631

chat_bubble_outline13

repeat104

shareShare

Wenhao Chai

@wenhaocha1

7 months ago

🎉 We’re excited to host two challenges at LOVE: Multimodal Video Agent Workshop at CVPR 2025, advancing the frontier of video-language understanding! #CVPR2025 #CVPR2025 📌 Track 1A: [VDC] Video Detailed Captioning Challenge Generate rich and structured captions that cover multiple

thumb_up_off_alt43

chat_bubble_outline2

repeat14

shareShare

Zechen Bai 🚇 ICLR 2025

@zechenbai

7 months ago

<Impossible Videos> is accepted by ICML 2025. Congrats to the team! 🎉🥳

thumb_up_off_alt13

chat_bubble_outline0

repeat1

shareShare

Apratim Bhattacharyya

@apratimbh

6 months ago

Happening now: Keynote by Mike Shou #CVPR2025

Happening now: Keynote by <a href="/MikeShou1/">Mike Shou</a> <a href="/CVPR/">#CVPR2025</a>

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Victor.Kai Wang

@victorkaiwang1

5 months ago

Customizing Your LLMs in seconds using prompts🥳! Excited to share our latest work with HPC-AI Lab, VITA Group, Konstantin Schürholt, Yang You, Michael Bronstein, Damian Borth : Drag-and-Drop LLMs(DnD). 2 features: tuning-free, comparable or even better than full-shot tuning.(🧵1/8)

thumb_up_off_alt97

chat_bubble_outline4

repeat71

shareShare

Yuxin Jiang

@jyuxinn

5 months ago

🚀A new way to use diffusion models for style transfer! Style Matching Score (SMS) is accepted to #ICCV2025🌺 We reframe image stylization as a style distribution matching problem. -Paper: arxiv.org/abs/2503.07601 -Code: github.com/showlab/SMS -Project: yuxinn-j.github.io/projects/SMS.h…

thumb_up_off_alt149

chat_bubble_outline7

repeat29

shareShare

Junyu Xie

@junyuxiearthur

5 months ago

Movies are more than just video clips, they are stories! 🎬 We’re hosting the 1st SLoMO Workshop at #ICCV2025 to discuss Story-Level Movie Understanding & Audio Descriptions! Website: slomo-workshop.github.io Competition: huggingface.co/spaces/SLoMO-W…

thumb_up_off_alt40

chat_bubble_outline1

repeat14

shareShare

Kevin Lin

@kevinqhlin

4 months ago

Glad to see GPT-5’s dynamic router balances fast answers and deep reasoning based on query complexity and user intent. In our recent work “Think-or-Not”: arxiv.org/pdf/2505.16854 We study this adaptive reasoning and introduce an easy-to-follow “thought-dropout” mechanism that

thumb_up_off_alt88

chat_bubble_outline2

repeat17

shareShare

Mike Shou

@mikeshou1

4 months ago

wow very interesting finding!

thumb_up_off_alt11

chat_bubble_outline1

repeat1

shareShare

Michael Qizhe Shieh

@mpulsewidth

3 months ago

Introducing MCPMark, a collaboration with Eval Sys and LobeHub! We created a challenging benchmark to stress-test MCP use in comprehensive contexts. - 127 high-quality data samples created by experts. - GPT-5 takes the current lead and achieves a Pass@1 of 46.96% while the

Introducing MCPMark, a collaboration with <a href="/EvalSysOrg/">Eval Sys</a> and <a href="/lobehub/">LobeHub</a>!

We created a challenging benchmark to stress-test MCP use in comprehensive contexts.
- 127 high-quality data samples created by experts.
- GPT-5 takes the current lead and achieves a Pass@1 of 46.96% while the

thumb_up_off_alt157

chat_bubble_outline4

repeat49

shareShare

Mike Shou

@mikeshou1

3 months ago

Attended Gemini event earlier this week and always enjoyed the talks from legendary google pioneers👍 One fun fact learned is, today, many ML people work on NLP, many NLPer work on vision-language, many vision people work on robot 😆

thumb_up_off_alt73

chat_bubble_outline1

repeat1

shareShare

Xavier Bresson

@xbresson

3 months ago

Academia invents the sparks e.g. Attention, GANs, DMs. Industry grows them into societal impacts s.a. AlphaFold, LLMs, GenAI. If you want to explore 1,000 ideas, go to academia -- if you want to scale-up a few promising ideas, go to industry.

thumb_up_off_alt119

chat_bubble_outline3

repeat9

shareShare

Jinheng Xie

@sierkinhane1

2 months ago

Big thanks to Zhenheng and my advisor Mike Shou for their support🙏Our unified multimodal model “Show-o2” got accepted to NeurIPS 2025! See you in San Diego 👋 #NeurIPS2025 Code is open-sourced & maintained here: github.com/showlab/Show-o

Big thanks to Zhenheng and my advisor <a href="/MikeShou1/">Mike Shou</a> for their support🙏Our unified multimodal model “Show-o2” got accepted to NeurIPS 2025! See you in San Diego 👋 #NeurIPS2025

Code is open-sourced & maintained here: github.com/showlab/Show-o

thumb_up_off_alt27

chat_bubble_outline0

repeat4

shareShare

Dima Damen

@dimadamen

2 months ago

As an SAC for #NeurIPS2025, I don't agree with PCs approach to reject papers based on ranking. I ranked papers as requested and explicitly stated that I support the acceptance of all papers. I wasn't given an explanation of why papers at the end of the rank were rejected.

thumb_up_off_alt135

chat_bubble_outline4

repeat6

shareShare