
Michael Tschannen
@mtschannen
Research Scientist @GoogleDeepMind. Representation learning for multimodal understanding and generation. Personal account.
ID: 597511633
https://mitscha.github.io 02-06-2012 15:38:44
238 Tweet
2,2K Followers
658 Following








SigLIP2 is indeed a better encoder than SigLIP! Over the last two weekends, I trained a new SOTA multilingual model - mexma-siglip2. It has not only improved performance but also MIT license. Michael Tschannen Xiaohua Zhai Lucas Beyer (bl16) merve SkalskiP






