Adrien Bardes (@adrienbardes) Twitter Tweets • TwiCopy

Aran Komatsuzaki

a year ago

Meta presents Image World Model Learning and Leveraging World Models in Visual Representation Learning arxiv.org/abs/2403.00504

thumb_up_off_alt301

chat_bubble_outline4

repeat76

shareShare

Meta presents Learning and Leveraging World Models in Visual Representation Learning Joint-Embedding Predictive Architecture (JEPA) has emerged as a promising self-supervised approach that learns by leveraging a world model. While previously limited to predicting missing parts

thumb_up_off_alt261

chat_bubble_outline3

repeat80

shareShare

Quentin Garrido

@garridoq_

a year ago

Fixed

thumb_up_off_alt191

chat_bubble_outline11

repeat31

shareShare

AI at Meta

@aiatmeta

a year ago

Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3

thumb_up_off_alt5,5K

chat_bubble_outline344

repeat1,1K

shareShare

AI at Meta

@aiatmeta

a year ago

More technical details on the new Meta Llama 3 models announced today. 🦙🧵

thumb_up_off_alt2,2K

chat_bubble_outline39

repeat267

shareShare

TimDarcet

@timdarcet

a year ago

ViT need registers got an outstanding paper award! Many thanks to the comittee for the honor

thumb_up_off_alt150

chat_bubble_outline5

repeat10

shareShare

AI at Meta

@aiatmeta

a year ago

New research from FAIR: Better & Faster Large Language Models via Multi-token Prediction Research paper ➡️ go.fb.me/wty7gj We show that replacing next token prediction tasks with multiple token prediction can result in substantially better code generation performance

thumb_up_off_alt1,1K

chat_bubble_outline31

repeat301

shareShare

AI at Meta

@aiatmeta

a year ago

Revisiting Feature Prediction for Learning Visual Representations from Video ➡️ arxiv.org/abs/2404.08471

thumb_up_off_alt41

chat_bubble_outline1

repeat9

shareShare

Badr Youbi Idrissi

@byoubii

a year ago

What happens if we make language models predict several tokens ahead instead of only the next one? In this paper, we show that multi-token prediction boosts language model training efficiency. 🧵 1/11 Paper: arxiv.org/abs/2404.19737 Joint work with Fabian Gloeckle

thumb_up_off_alt561

chat_bubble_outline15

repeat97

shareShare

AI at Meta

@aiatmeta

a year ago

📝 New from FAIR: An Introduction to Vision-Language Modeling. Vision-language models (VLMs) are an area of research that holds a lot of potential to change our interactions with technology, however there are many challenges in building these types of models. Together with a set

thumb_up_off_alt2,2K

chat_bubble_outline37

repeat487

shareShare

Pietro Astolfi

@piovrasca

a year ago

Are sota image generative models effective world models? Consistency-diversity-realism Pareto fronts show they're not (yet): - No model dominates others as a world model - Improvements in quality and consistency have come at the expense of diversity 🔗 arxiv.org/abs/2406.10429

thumb_up_off_alt194

chat_bubble_outline4

repeat42

shareShare

Michal Valko

@misovalko

a year ago

Yann LeCun from AI at Meta greets the public at the exclusive Open Source AI Day event at STATION F at panel with adriennejan from Scaleway, Thomas Wolf from Hugging Face and Patrick Pérez from kyutai Join us as lnkd.in/gwphNpV2 We will continue by great...

<a href="/ylecun/">Yann LeCun</a> from <a href="/AIatMeta/">AI at Meta</a> greets the public at the exclusive Open Source AI Day event at <a href="/joinstationf/">STATION F</a> at panel with <a href="/adriennejan/">adriennejan</a> from <a href="/Scaleway/">Scaleway</a>, <a href="/Thom_Wolf/">Thomas Wolf</a> from <a href="/huggingface/">Hugging Face</a> and <a href="/ptrkprz/">Patrick Pérez</a> from <a href="/kyutai_labs/">kyutai</a>

Join us as lnkd.in/gwphNpV2

We will continue by great...

thumb_up_off_alt79

chat_bubble_outline3

repeat12

shareShare

AI at Meta

@aiatmeta

a year ago

Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet. Today we’re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context

thumb_up_off_alt5,5K

chat_bubble_outline271

repeat1,1K

shareShare

AI at Meta

@aiatmeta

a year ago

Introducing Meta Segment Anything Model 2 (SAM 2) — the first unified model for real-time, promptable object segmentation in images & videos. SAM 2 is available today under Apache 2.0 so that anyone can use it to build their own experiences Details ➡️ go.fb.me/p749s5

thumb_up_off_alt7,7K

chat_bubble_outline153

repeat1,1K

shareShare

Adrien Bardes

@adrienbardes

10 months ago

Job alert 🚨 My team AI at Meta is looking for a PhD intern to join us in 2025 in Paris. We are working on self-supervised learning from video, world modelling and JEPA ! Apply here or reach out directly: metacareers.com/jobs/168411027…

thumb_up_off_alt234

chat_bubble_outline3

repeat47

shareShare

TimDarcet

@timdarcet

7 months ago

Want strong SSL, but not the complexity of DINOv2? CAPI: Cluster and Predict Latents Patches for Improved Masked Image Modeling.

thumb_up_off_alt600

chat_bubble_outline21

repeat108

shareShare

Quentin Garrido

@garridoq_

7 months ago

The last paper of my PhD is finally out ! Introducing "Intuitive physics understanding emerges from self-supervised pretraining on natural videos" We show that without any prior, V-JEPA --a self-supervised video model-- develops an understanding of intuitive physics !

thumb_up_off_alt897

chat_bubble_outline19

repeat163

shareShare

Pierre Chambon

@pierrechambon6

5 months ago

Does your LLM truly comprehend the complexity of the code it generates? 🥰 Introducing our new non-saturated (for at least the coming week? 😉) benchmark: ✨BigO(Bench)✨ - Can LLMs Generate Code with Controlled Time and Space Complexity? Check out the details below !👇

thumb_up_off_alt119

chat_bubble_outline9

repeat26

shareShare

AI at Meta

@aiatmeta

3 months ago

Our vision is for AI that uses world models to adapt in new and dynamic environments and efficiently learn new skills. We’re sharing V-JEPA 2, a new world model with state-of-the-art performance in visual understanding and prediction. V-JEPA 2 is a 1.2 billion-parameter model,

thumb_up_off_alt1,1K

chat_bubble_outline78

repeat346

shareShare

Adrien Bardes

Aran Komatsuzaki

AK

Quentin Garrido

AI at Meta

AI at Meta

TimDarcet

AI at Meta

AI at Meta

Badr Youbi Idrissi

AI at Meta

Pietro Astolfi

Michal Valko

AI at Meta

AI at Meta

Adrien Bardes

TimDarcet

Quentin Garrido

Pierre Chambon

AI at Meta