Kai Han (@kaihan_vis) 's Twitter Profile
Kai Han

@kaihan_vis

Computer Vision || Asst. Prof. @HKUniversity | Ex-{Researcher @GoogleAI, Asst. Prof. @BristolUni, Postdoc @Oxford_VGG, @UniofOxford}

ID: 1022225339118964736

linkhttps://www.kaihan.org/ calendar_today25-07-2018 21:01:27

238 Tweet

1,1K Takipรงi

563 Takip Edilen

Jianyuan Wang (@jianyuan_wang) 's Twitter Profile Photo

VGGT has been re-licensed to allow commercial usage. Enjoy the gift ๐Ÿ˜‰๐Ÿ˜‡ ๐Ÿ‘‰ huggingface.co/facebook/VGGT-โ€ฆ

VGGT has been re-licensed to allow commercial usage. Enjoy the gift ๐Ÿ˜‰๐Ÿ˜‡

๐Ÿ‘‰ huggingface.co/facebook/VGGT-โ€ฆ
Samuel Albanie ๐Ÿ‡ฌ๐Ÿ‡ง (@samuelalbanie) 's Twitter Profile Photo

We just shipped Gemini 2.5 Deep Think it doesn't just recall research papers - it fuses ideas across papers in ways I haven't seen before this level of capability demands careful evaluation model card below ๐Ÿ‘‡

We just shipped Gemini 2.5 Deep Think

it doesn't just recall research papers - it fuses ideas across papers in ways I haven't seen before

this level of capability demands careful evaluation

model card below ๐Ÿ‘‡
Manling Li (@manlingli_) 's Twitter Profile Photo

All week during rebuttals, I have started each day with the same reminder: stay humble, stay kind, don't let this turn me mean. When I was doing PhD, reviewers never felt this mean. There is a bright-eyed student sitting on the other side, and such reviews will destroy

Piotr Bojanowski (@p_bojanowski) 's Twitter Profile Photo

I am happy to share the work of our team. The outcome of a collaborative effort, by a joyful group of skilled and determined scientists and engineers! Congrats to the team on this amazing milestone!

Kai Han (@kaihan_vis) 's Twitter Profile Photo

๐Ÿ†๐Ÿ†๐Ÿ†Clash of the Titans (GPT 5 vs. Gemini 2.5 Pro) on GAMEBoT: Connect4-->11:8 Checkers-->20:0 #GPT5,#Gemini

Ragav Sachdeva (@ragavsachdeva) 's Twitter Profile Photo

๐Ÿšจ๐Ÿšจ๐ŸšจDeadline <10 days ๐Ÿšจ๐Ÿšจ๐Ÿšจ Submit your extended abstracts on AI driven comic analysis to the COMIQ workshop #ICCV2025 comiq-iccv25.github.io

Zhengzhong Tu (@_vztu) 's Twitter Profile Photo

๐Ÿš€ Excited to share that our paper "๐—ฅ๐—ฒ-๐—”๐—น๐—ถ๐—ด๐—ป: ๐—”๐—น๐—ถ๐—ด๐—ป๐—ถ๐—ป๐—ด ๐—ฉ๐—ถ๐˜€๐—ถ๐—ผ๐—ป ๐—Ÿ๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐˜ƒ๐—ถ๐—ฎ ๐—ฅ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฒ๐˜ƒ๐—ฎ๐—น-๐—”๐˜‚๐—ด๐—บ๐—ฒ๐—ป๐˜๐—ฒ๐—ฑ ๐——๐—ถ๐—ฟ๐—ฒ๐—ฐ๐˜ ๐—ฃ๐—ฟ๐—ฒ๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ข๐—ฝ๐˜๐—ถ๐—บ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป" has been accepted to EMNLP 2025 (Main Track)! ๐ŸŽ‰ Large

๐Ÿš€ Excited to share that our paper "๐—ฅ๐—ฒ-๐—”๐—น๐—ถ๐—ด๐—ป: ๐—”๐—น๐—ถ๐—ด๐—ป๐—ถ๐—ป๐—ด ๐—ฉ๐—ถ๐˜€๐—ถ๐—ผ๐—ป ๐—Ÿ๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น๐˜€ ๐˜ƒ๐—ถ๐—ฎ ๐—ฅ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฒ๐˜ƒ๐—ฎ๐—น-๐—”๐˜‚๐—ด๐—บ๐—ฒ๐—ป๐˜๐—ฒ๐—ฑ ๐——๐—ถ๐—ฟ๐—ฒ๐—ฐ๐˜ ๐—ฃ๐—ฟ๐—ฒ๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ข๐—ฝ๐˜๐—ถ๐—บ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป" has been accepted to EMNLP 2025 (Main Track)! ๐ŸŽ‰

Large
Kai Han (@kaihan_vis) 's Twitter Profile Photo

Code released for Hyperbolic Category Discovery --- A simpleย hyperbolic framework for learning hierarchy-aware representations and classifiers for GCD, achieving the SOTA performance. ๐Ÿ’ป Code: github.com/Visual-AI/HypCD ๐ŸŒ Page: visual-ai.github.io/hypcd/

F. Gรผney (@ftm_guney) 's Twitter Profile Photo

visited Oxford for a couple of days for the robotics research groupโ€™s 40th anniversary (which VGG is part of) and its founder Mike Bradyโ€™s 80th birthday. it was nice to be back, walking in the University Parks and seeing dearly missed friends. some advice I heard there & loved:

visited Oxford for a couple of days for the robotics research groupโ€™s 40th anniversary (which VGG is part of) and its founder Mike Bradyโ€™s 80th birthday. it was nice to be back, walking in the University Parks and seeing dearly missed friends. some advice I heard there &amp; loved:
Kai Han (@kaihan_vis) 's Twitter Profile Photo

Introducing Inpaint4Drag, a novel drag-based editing framework, which is fast and effective.๐ŸŽจ๐Ÿš€๐Ÿ› ๏ธ ๐Ÿ“œPaper: arxiv.org/abs/2509.04582 ๐Ÿง‘โ€๐Ÿ’ปProject&code: visual-ai.github.io/inpaint4drag/ #ICCV2025

Yuki (@y_m_asano) 's Twitter Profile Photo

Our paper 'Selfโ€‘Labelling via Simultaneous Clustering and Representation Learning' just got its 1000th citation. On that occasion, I want to give my perspective on this question: Who or what is Sinkhornโ€“Knopp? Short answer: Itโ€™s the little ~1960s matrixโ€‘normalization workhorse

Saining Xie (@sainingxie) 's Twitter Profile Photo

three years ago, DiT replaced the legacy unet with a transformer-based denoising backbone. we knew the bulky VAEs would be the next to go -- we just waited until we could do it right. today, we introduce Representation Autoencoders (RAE). >> Retire VAEs. Use RAEs. ๐Ÿ‘‡(1/n)

three years ago, DiT replaced the legacy unet with a transformer-based denoising backbone. we knew the bulky VAEs would be the next to go -- we just waited until we could do it right.

today, we introduce Representation Autoencoders (RAE).

&gt;&gt; Retire VAEs. Use RAEs. ๐Ÿ‘‡(1/n)