
Wassim (Wes) Bouaziz
@_vassim
PhD student @MetaAI and @Polytechnique
Previously @ENS_ULM @ENS_ParisSaclay
I confront equations and inequalities💡
ID: 227737847
https://wesbz.github.io/ 17-12-2010 17:29:29
3,3K Tweet
627 Followers
2,2K Following

We present an Autoregressive U-Net that incorporates tokenization inside the model, pooling raw bytes into words then word-groups. AU-Net focuses most of its compute on building latent vectors that correspond to larger units of meaning. Joint work with Badr Youbi Idrissi 1/8


Here is the recording with the slides for those interested! 🎤 youtu.be/UONvP1TL0-g?fe… 📊drive.google.com/file/d/14ZIopS… 📑arxiv.org/pdf/2410.02724 Cohere Labs Cohere Labs




Why does Meta open-source its models? I talked about it with Maciej Kawecki - This Is IT looking at Dino, our computer vision model with applications in forest mapping, medical research, agriculture and more. Open-source boosts AI access, transparency, and safety. youtube.com/watch?v=eNGafi…

heading to ICML Conference #ICML2025 next week! come say hi & i'd love to learn about your work :) i'll present this paper (arxiv.org/abs/2503.17514) on the pitfalls of training set inclusion in LLMs, Thursday 11am here are my talk slides to flip through: ai.stanford.edu/~kzliu/files/m…


🚀 We are happy to organize the BERT²S workshop NeurIPS Conference 2025 on Recent Advances in Time Series Foundation Models. 🌐 berts-workshop.github.io 📜Submit by August 22 🎓Speakers and panelists: Chenghao Liu Mingsheng Long Zoe Piran Danielle Maddix Ameet Talwalkar Qingsong Wen, PhD, Head of AI @ Squirrel AI



🚀New paper alert! 🚀 In our work AI at Meta we dive into the struggles of mixing languages in largely multilingual Transformer encoders and use the analysis as a tool to better design multilingual models to obtain optimal performance. 📄: arxiv.org/abs/2508.02256 🧵(1/n)



