
Xihui Liu
@xihuiliu
Assistant Professor @ HKU. Previous Postdoc @ UC Berkeley and PhD @ CUHK MMLab
ID: 3329738772
https://xh-liu.github.io/ 24-08-2015 20:11:40
41 Tweet
1,1K Takipçi
198 Takip Edilen







Why are discrete visual tokenizers difficult to scale? In GigaTok, we study the key factors for scaling tokenizers, and scale VQ tokenizers to 3B for better reconstruction, AR generation, and representation. Code and models released huggingface.co/papers/2504.08… silentview.github.io/GigaTok/







