Benjamin Muller (@ben_mlr) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

A restricted, safety aligned (no-image-out) version of Chameleon (7B/34B) is now open-weight! github.com/facebookresear… The team strongly believes in open-source. We had to do a lot of work to get this out to the public safely. Congrats to the Chameleon team!

thumb_up_off_alt413

chat_bubble_outline13

repeat50

shareShare

Benjamin Muller

@ben_mlr

a year ago

It was great to present the Spirit-LM model with tuanh208 Spirit-LM is a foundation model that jointly learns text and expressive speech based on Llama 2. Thanks TwelveLabs (twelvelabs.io) for organizing the webinar Arxiv available here for more details: arxiv.org/abs/2402.05755

thumb_up_off_alt9

chat_bubble_outline1

repeat2

shareShare

Soumith Chintala

@soumithchintala

a year ago

I'm giving the opening Keynote at ICML 2024 on Tuesday the 23rd @ 9:30am CEST. I'll try empower folks to get Open Science back on track -- the free discussion of ideas is such an important aspect of AI progress, and we've been losing track. This is a complex topic, and I wont

thumb_up_off_alt664

chat_bubble_outline23

repeat66

shareShare

Laurens van der Maaten

@lvdmaaten

a year ago

So… we trained a model and we wrote a paper about it. Have fun y’all! llama.meta.com/llama-download… ai.meta.com/research/publi…

thumb_up_off_alt430

chat_bubble_outline11

repeat52

shareShare

AI at Meta

@aiatmeta

a year ago

Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet. Today we’re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context

thumb_up_off_alt5,5K

chat_bubble_outline271

repeat1,1K

shareShare

AI at Meta

@aiatmeta

10 months ago

LLM Evaluations are an important area of work — today we're announcing a new LLM Evaluation Research Grant to foster further innovation in this area. Recipients will get $200K in funding to support this work. We're accepting proposals until September 6 ➡️ go.fb.me/eym3xq

thumb_up_off_alt460

chat_bubble_outline15

repeat91

shareShare

Chunting Zhou

@violet_zct

10 months ago

Introducing *Transfusion* - a unified approach for training models that can generate both text and images. arxiv.org/pdf/2408.11039 Transfusion combines language modeling (next token prediction) with diffusion to train a single transformer over mixed-modality sequences. This

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat209

shareShare

Andrew Brown

@andrew__brown__

9 months ago

OK here goes the "excited to share ...." post Want to know how to train a T2V model (with other amazing capabilities) that beats ALL prior work ?? Well we released a 90 page tech report with every detail 😊 ai.meta.com/research/movie…… Thanks to the amazing team!

thumb_up_off_alt179

chat_bubble_outline10

repeat16

shareShare

Benjamin Muller

@ben_mlr

9 months ago

Recent LLMs (e.g. LLama 3 🦙) are increasingly good at Math. However, this progress is reserved for languages with large amounts of task-specific instruct-tuning data. In this work AI at Meta (led by Lucas Bandarkar ), we introduce a new model merging technique called **Layer

thumb_up_off_alt28

chat_bubble_outline2

repeat6

shareShare

Yann LeCun

@ylecun

8 months ago

Meta Spirit LM: open source language model that mixes text and speech.

thumb_up_off_alt334

chat_bubble_outline19

repeat71

shareShare

Xiang Yue@ICLR2025🇸🇬

@xiangyue96

8 months ago

🌍 I’ve always had a dream of making AI accessible to everyone, regardless of location or language. However, current open MLLMs often respond in English, even to non-English queries! 🚀 Introducing Pangea: A Fully Open Multilingual Multimodal LLM supporting 39 languages! 🌐✨

thumb_up_off_alt385

chat_bubble_outline7

repeat78

shareShare

Benjamin Muller

@ben_mlr

7 months ago

Congrats Aymeric Zhuo and team on being live! Very exciting vision to build entire softwares with just a prompt

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Benjamin Muller

@ben_mlr

6 months ago

Groundbreaking scaling trends for Byte-level Language Modeling with the new BLT architecture 🚀 More insights in the thread 🧵

thumb_up_off_alt21

chat_bubble_outline0

repeat3

shareShare

AI at Meta

@aiatmeta

6 months ago

New from Meta FAIR — Byte Latent Transformer: Patches Scale Better Than Tokens introduces BLT, which for the first time, matches tokenization-based LLM performance at scale with significant improvements in inference efficiency & robustness. Paper ➡️ go.fb.me/w23lmz

thumb_up_off_alt1,1K

chat_bubble_outline28

repeat192

shareShare

Gargi Ghosh

@gargighosh

6 months ago

We released new research - Byte Latent Transformer(BLT) BLT encodes bytes into dynamic patches using light-weight local models and processes them with a large latent transformer. Think of it as a transformer sandwich!

thumb_up_off_alt668

chat_bubble_outline11

repeat82

shareShare

Jason Weston

@jaseweston

5 months ago

🚨 Diverse Preference Optimization (DivPO) 🚨 SOTA LLMs have model collapse🫠: they can't generate diverse creative writing or synthetic data 🎨 DivPO trains for both high reward & diversity, vastly improving variety with similar quality. Paper 📝: arxiv.org/abs/2501.18101 🧵below

thumb_up_off_alt343

chat_bubble_outline1

repeat77

shareShare

AI at Meta

@aiatmeta

3 months ago

Today is the start of a new era of natively multimodal AI innovation. Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick — our most advanced models yet and the best in their class for multimodality. Llama 4 Scout • 17B-active-parameter model

thumb_up_off_alt13,13K

chat_bubble_outline706

repeat2,2K

shareShare

Percy Liang

@percyliang

3 months ago

We ran Llama 4 Maverick through some HELM benchmarks. It is 1st on HELM capabilities (MMLU-Pro, GPQA, IFEval, WildBench, Omni-MATH), but… crfm.stanford.edu/helm/capabilit…

thumb_up_off_alt142

chat_bubble_outline6

repeat17

shareShare

Benjamin Muller

Gate.io

Armen Aghajanyan

Benjamin Muller

Soumith Chintala

Laurens van der Maaten

AI at Meta

AI at Meta

Chunting Zhou

Andrew Brown

Benjamin Muller

Yann LeCun

Xiang Yue@ICLR2025🇸🇬

Benjamin Muller

Benjamin Muller

AI at Meta

Gargi Ghosh

Jason Weston

AI at Meta

Percy Liang