João Maria Janeiro (@joaomjaneiro) 's Twitter Profile
João Maria Janeiro

@joaomjaneiro

PhD Student @ Meta AI Paris & Sorbonne University | 🇵🇹

ID: 1588861604397027330

calendar_today05-11-2022 11:51:52

27 Tweet

67 Takipçi

134 Takip Edilen

TimDarcet (@timdarcet) 's Twitter Profile Photo

🚨 RELEASE ALERT ‼️ github.com/facebookresear… THIS CHANGES EVERYTHING $META just dropped a game-changing codebase! Now everyone can do LLM research! 😱 🧵10 best things people are already building with lingua 🔥👇

Adrien Bardes (@adrienbardes) 's Twitter Profile Photo

Job alert 🚨 My team AI at Meta is looking for a PhD intern to join us in 2025 in Paris. We are working on self-supervised learning from video, world modelling and JEPA ! Apply here or reach out directly: metacareers.com/jobs/168411027…

Mathurin Videau (@mathuvu_) 's Twitter Profile Photo

Meta Lingua: a minimal, fast LLM codebase for training and inference. By researchers, for researchers. Easily hackable, still reproducible. Built-in efficiency, profiling (cpu, gpu and mem) and interpretability (automatic activation and gradient statistics) Joint work w/ Badr Youbi Idrissi

Tom Sander @NeurIPS (@rednastom) 's Twitter Profile Photo

🔒Image watermarking is promising for digital content protection. But images often undergo many modifications—spliced or altered by AI. Today at AI at Meta, we released Watermark Anything that answers not only "where does the image come from," but "what part comes from where." 🧵

🔒Image watermarking is promising  for digital content protection. But images often undergo many modifications—spliced or altered by AI. Today at <a href="/AIatMeta/">AI at Meta</a>, we released Watermark Anything that answers not only "where does the image come from," but "what part comes from where." 🧵
João Maria Janeiro (@joaomjaneiro) 's Twitter Profile Photo

It was great being a part of this large project with so many amazing people! Reinventing the way text generation models work, moving away from the traditional token paradigm of LLMs. Check it out! Paper: arxiv.org/abs/2412.08821

Belen Alastruey (@b_alastruey) 's Twitter Profile Photo

Happy to share our team's work on Large Concept Models (LCMs), a new approach for language modeling that goes beyond standard token-based LLMs by operating in a multilingual and multimodal embedding space. Check out the full paper! 📄: ai.meta.com/research/publi…

Happy to share our team's work on Large Concept Models (LCMs), a new approach for language modeling that goes beyond standard token-based LLMs by operating in a multilingual and multimodal embedding space.  Check out the full paper!

📄:  ai.meta.com/research/publi…
João Maria Janeiro (@joaomjaneiro) 's Twitter Profile Photo

Amazing new works on audio generation from AI at Meta , specifically stem-level music edition through text and audio language modelling trained on watermarked data with easy detection!

Piotr Bojanowski (@p_bojanowski) 's Twitter Profile Photo

🔥 The DINO team is looking for a PostDoc! 🔥 If you are about to graduate, and want to be part of what’s next for SSL, don’t hesitate to reach out! Link to job offer : metacareers.com/jobs/502476149…

João Maria Janeiro (@joaomjaneiro) 's Twitter Profile Photo

Another great paper by TimDarcet et al (AI at Meta). They explore how to effectively make masked image modeling work, via thorough exploration on all components of the pipeline, from masking, to the architecture, loss, targets to predict... Check it out! Code is also accessible!

João Maria Janeiro (@joaomjaneiro) 's Twitter Profile Photo

Another banger from Quentin Garrido et al from AI at Meta. They explore how JEPA models (models predicting in latent space) have a better understanding of intuitive physics! Check it out:

Wassim (Wes) Bouaziz (@_vassim) 's Twitter Profile Photo

Want to know if a ML model was trained on your dataset with 1 API call? See you in conferences 🙌 Excited to share that our paper Data Taggants for image data was accepted at ICLR 2025 🎉 Our follow-up on audio data, was accepted at ICASSP 2025! 🎉 Check out the details below 👇

Want to know if a ML model was trained on your dataset with 1 API call? See you in conferences 🙌

Excited to share that our paper Data Taggants for image data was accepted at ICLR 2025 🎉
Our follow-up on audio data, was accepted at ICASSP 2025! 🎉
Check out the details below 👇
João Maria Janeiro (@joaomjaneiro) 's Twitter Profile Photo

Want to know if your LLM can understand code well? Check out this new paper by Pierre Chambon from AI at Meta! It is a complex and non saturated benchmark that will surely put LLMs to the test on their understanding of code!

João Maria Janeiro (@joaomjaneiro) 's Twitter Profile Photo

Are you struggling to extract relevant features from your data? Check out this new work from Krunoslav Lehman Pavasovic from AI at Meta, where they propose a new training objective to learn more relevant features!

Kunhao Zheng @ ICLR 2025 (@kunhaoz) 's Twitter Profile Photo

🚨 Your RL only improves 𝗽𝗮𝘀𝘀@𝟭, not 𝗽𝗮𝘀𝘀@𝗸? 🚨 That’s not a bug — it’s a 𝗳𝗲𝗮𝘁𝘂𝗿𝗲 𝗼𝗳 𝘁𝗵𝗲 𝗼𝗯𝗷𝗲𝗰𝘁𝗶𝘃𝗲 you’re optimizing. You get what you optimize for. If you want better pass@k, you need to optimize for pass@k at training time. 🧵 How?

🚨 Your RL only improves 𝗽𝗮𝘀𝘀@𝟭, not 𝗽𝗮𝘀𝘀@𝗸? 🚨

That’s not a bug — it’s a 𝗳𝗲𝗮𝘁𝘂𝗿𝗲 𝗼𝗳 𝘁𝗵𝗲 𝗼𝗯𝗷𝗲𝗰𝘁𝗶𝘃𝗲 you’re optimizing.

You get what you optimize for. If you want better pass@k, you need to optimize for pass@k at training time.

🧵 How?
João Maria Janeiro (@joaomjaneiro) 's Twitter Profile Photo

Are you struggling to improve the performance of your multilingual model? The reason might be because of the languages you are mixing! But how can you know what languages to mix to maximize performance? Check our paper!