Thomas Scialom (@thomasscialom) 's Twitter Profile
Thomas Scialom

@thomasscialom

AGI Researcher @MetaAI -- I led Llama 2 and Postraining Llama 3. Also CodeLlama, Galactica, Toolformer, Bloom, Nougat, GAIA, ..

ID: 942694791707545600

linkhttps://www.linkedin.com/in/tscialom/ calendar_today18-12-2017 09:55:27

1,1K Tweet

7,7K Followers

209 Following

Antoine Moyroud 🧑‍🚀🇫🇷 (@antoine_moyroud) 's Twitter Profile Photo

We're starting things off in #Paris, the rising European AI epicenter, in a few weeks time with a stellar duo. 🇫🇷 Thomas Scialom and Guillaume Lample @ ICLR 2024 will be joining me to discuss the latest advancements in the field of AI and specifically #LLMs

Mark Riedl (@mark_riedl) 's Twitter Profile Photo

I propose that industry AI impact be measured in “Llamas”, a new metric that I just made up. Either this reinforces the narrative that industry is an exciting place to be doing AI, or is incredibly depressing because everyone is chasing the same ideas around in circles.

Yann LeCun (@ylecun) 's Twitter Profile Photo

AI systems are fast becoming a basic infrastructure. Historically, basic infrastructure always ends up being open source (think of the software infra of the internet, including Linux, Apache, JavaScript and browser engines, etc) It's the only way to make it reliable, secure, and

Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

In fact there is on perplexity demo a specific system prompt that amplifes over safe responses. It has been removed from other demos like HF. Perplexity Denis Yarats could we deactivate it as well by default please?

Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

It did in fact. RLHF is the technology behind chatgpt and probably dalle3. To panned out on real-world problems it needed nothing more than human feedback rewards.

Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

I strongly disagree. There are many paths to success, and doing a PhD is never a suboptimal choice. Both professionally and personally.

Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

At the AI-pulse today I talked about -- surprise -- LLMs. There short history, a deep dive into Llama 2, the magic behind RLHF, and my vision of where of the future of the field. Thanks Scaleway for the opportunity!

At the AI-pulse today I talked about -- surprise -- LLMs. There short history, a deep dive into Llama 2, the magic behind RLHF, and my vision of where of the future of the field.
Thanks <a href="/Scaleway/">Scaleway</a> for the opportunity!
AK (@_akhaliq) 's Twitter Profile Photo

GAIA: a benchmark for General AI Assistants paper page: huggingface.co/papers/2311.12… introduce GAIA, a benchmark for General AI Assistants that, if solved, would represent a milestone in AI research. GAIA proposes real-world questions that require a set of fundamental abilities such

GAIA: a benchmark for General AI Assistants

paper page: huggingface.co/papers/2311.12…

introduce GAIA, a benchmark for General AI Assistants that, if solved, would represent a milestone in AI research. GAIA proposes real-world questions that require a set of fundamental abilities such
Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

Despite being an amazing paper, chinchilla did/could not be open-source. Llama-1 has now more than 10x citations than Chinchilla.

Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

Delighted to finally introduce Llama 3: The most capable openly available LLM to date. Long jouney since Llama-2, a big shoutout to the incredible team effort that made this possible, and stay tuned, we will keep building🦙 ai.meta.com/blog/meta-llam…

Delighted to finally introduce Llama 3: The most capable openly available LLM to date. Long jouney since Llama-2, a big shoutout to the incredible team effort that made this possible, and stay tuned, we will keep building🦙
ai.meta.com/blog/meta-llam…
Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community joined us with Hugging Face, kyutai, Google DeepMind (Gemma), cohere As someone said: better that the building remains safe, or ciao the open source for AI 😆

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community  joined us with <a href="/huggingface/">Hugging Face</a>, <a href="/kyutai_labs/">kyutai</a>, <a href="/GoogleDeepMind/">Google DeepMind</a> (Gemma), <a href="/cohere/">cohere</a>
As someone said: better that the building remains safe, or  ciao the open source for AI 😆
Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

I am at ICLR.. 🦙 Llama-3: I ll be every morning at 11am at the AI at Meta for Llama-3 QA sessions 🤖 GAIA: General AI Assistant benchmark w/ Gregoire 🔭 NOUGAT: for Scientific OCR w/ Lukas And if you are interested in post-training, rlhf, agents i m down for ☕&🍺 ICLR 2025

Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

The team worked really hard to make history, voila finally the Llama-3.1 herd of models...have fun with it! * open 405B, insane 70B * 128K context length, improved reasoning & coding capabilities * detailed paper ai.meta.com/research/publi…

The team worked really hard to make history, voila finally the Llama-3.1 herd of models...have fun with it!
  * open 405B, insane 70B
  * 128K context length, improved reasoning &amp; coding capabilities
  * detailed paper ai.meta.com/research/publi…
Latent.Space (@latentspacepod) 's Twitter Profile Photo

🆕 pod with Thomas Scialom of AI at Meta! Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI latent.space/p/llama-3 shoutouts: - Why Yann LeCun's Galactica Instruct would have solved Lucas Beyer (bl16)'s Citations Generator - Beyond Chinchilla-Optimal: 100x

Yifei Hu (@hu_yifei) 's Twitter Profile Photo

My humble opinion on the "ACL is not an AI Conference" discussion: Statements like "Language Models are stochastic parrots", "arXiv is a cancer", and "ACL is not XXX" (btw, all came from the same well-known person) are completely personal and subjective. None of these discussions