Mathieu (@mathieu_rita) 's Twitter Profile
Mathieu

@mathieu_rita

Research Scientist @AIatMeta ex: INRIA-MSR | @CoML_ENS | @Polytechnique Llama3 - RL fine-tuning - Emergent communication

ID: 320869425

linkhttps://mathieurita.github.io/ calendar_today20-06-2011 17:19:56

96 Tweet

216 Takipçi

268 Takip Edilen

CoML (Cognitive Machine Learning) | @ENS (@coml_ens) 's Twitter Profile Photo

[🚨Recruitment🗣️] CoML team is actively recruiting a Postdoctoral Fellow with expertise in machine learning, linguistics, or cognitive science. More information in the detailed announcement here : cognitive-ml.fr/docs/fiche_pos…

Baptiste Rozière (@b_roziere) 's Twitter Profile Photo

We released a 70B version of CodeLlama today! Trained on 1T tokens, it is a much stronger base model for coding tasks. I look forward to seeing what the community will do with it! :)

AK (@_akhaliq) 's Twitter Profile Photo

Meta presents SpiRit-LM Interleaved Spoken and Written Language Model paper page: huggingface.co/papers/2402.05… introduce SPIRIT-LM, a foundation multimodal language model that freely mixes text and speech. Our model is based on a pretrained text language model that we extend to the

Meta presents SpiRit-LM

Interleaved Spoken and Written Language Model

paper page: huggingface.co/papers/2402.05…

introduce SPIRIT-LM, a foundation multimodal language model that freely mixes text and speech. Our model is based on a pretrained text language model that we extend to the
fly51fly (@fly51fly) 's Twitter Profile Photo

[CL] Language Evolution with Deep Learning M Rita, P Michel, R Chaabouni, O Pietquin, E Dupoux, F Strub [INRIA & Google DeepMind] (2024) arxiv.org/abs/2403.11958 - Deep learning is well-suited for simulating communication games and studying language emergence and evolution. -

[CL] Language Evolution with Deep Learning
M Rita, P Michel, R Chaabouni, O Pietquin, E Dupoux, F Strub [INRIA & Google DeepMind] (2024)
arxiv.org/abs/2403.11958

- Deep learning is well-suited for simulating communication games and studying language emergence and evolution. 

-
AI at Meta (@aiatmeta) 's Twitter Profile Photo

Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3

lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Exciting update -- Llama-3 full result is out, now reaching top-5 on the Arena leaderboard🔥 We've got stable enough CIs with over 12K votes. No question now Llama-3 70B is the new king of open model. Its powerful 8B variant has also surpassed many larger-size models. What an

Exciting update -- Llama-3 full result is out, now reaching top-5 on the Arena leaderboard🔥

We've got stable enough CIs with over 12K votes. No question now Llama-3 70B is the new king of open model. Its powerful 8B variant has also surpassed many larger-size models. What an
lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Moreover, we observe even stronger performance in English category, where Llama 3 ranking jumps to ~1st place with GPT-4-Turbo! It consistently performs strong against top models (see win-rate matrix) by human preference. It's been optimized for dialogue scenario with large

Moreover, we observe even stronger performance in English category, where Llama 3 ranking jumps to ~1st place with GPT-4-Turbo!

It consistently performs strong against top models (see win-rate matrix) by human preference. It's been optimized for dialogue scenario with large
Thomas Scialom (@thomasscialom) 's Twitter Profile Photo

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community joined us with Hugging Face, kyutai, Google DeepMind (Gemma), cohere As someone said: better that the building remains safe, or ciao the open source for AI 😆

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community  joined us with <a href="/huggingface/">Hugging Face</a>, <a href="/kyutai_labs/">kyutai</a>, <a href="/GoogleDeepMind/">Google DeepMind</a> (Gemma), <a href="/cohere/">cohere</a>
As someone said: better that the building remains safe, or  ciao the open source for AI 😆
lmarena.ai (formerly lmsys.org) (@lmarena_ai) 's Twitter Profile Photo

Exciting new blog -- What’s up with Llama-3? Since Llama 3’s release, it has quickly jumped to top of the leaderboard. We dive into our data and answer below questions: - What are users asking? When do users prefer Llama 3? - How challenging are the prompts? - Are certain users

Exciting new blog -- What’s up with Llama-3?

Since Llama 3’s release, it has quickly jumped to top of the leaderboard. We dive into our data and answer below questions:

- What are users asking? When do users prefer Llama 3?
- How challenging are the prompts?
- Are certain users
Yann LeCun (@ylecun) 's Twitter Profile Photo

💥BOOM 💥 Llama 3.1 is out 💥 405B, 70B, 8B versions. Main takeaways: 1. 405B performance is on par with the best closed models. 2. Open/free weights and code, with a license that enables fine-tuning, distillation into other models, and deployment anywhere. 3. 128k context

Jérémie Kalfon (@jkobject) 's Twitter Profile Photo

This allows scPRINT zero-shot abilities -meaning no fine-tuning required- such as artificially increasing the depth of the expression profile of a cell (denoising / zero imputation), predicting the cell type, disease, sequencer, and sex of a cell, as well as creating cell

This allows scPRINT zero-shot abilities -meaning no fine-tuning required- such as artificially increasing the depth of the expression profile of a cell (denoising / zero imputation), predicting the cell type, disease, sequencer, and sex of a cell, as well as creating cell
Language Gamification Workshop @ NeurIPS 2024 (@mllanguagegames) 's Twitter Profile Photo

🔔🚨 [ALERT] Calls for papers! 🚨🔔 Language Gamification Workshop @ NeurIPS 2024 openreview.net/group?id=NeurI… 🤔 Topics: In-Context Learning, Deep Reinforcement Learning, Modern NLP, Multi-Agent Learning, Language Emergence, Embodiment, Cognitive Science... ⏰ Deadline: August 30

Rui Hou (@magpie_rayhou) 's Twitter Profile Photo

Our team, Llama Post-training, is looking to hire 2025 PhD Research Interns to join us at Meta GenAI. If you are interested in working on RL for LLM, Code Generation, Reasoning, and Agents with us, drop me a message with your CV. Link: metacareers.com/jobs/106355302…

CoML (Cognitive Machine Learning) | @ENS (@coml_ens) 's Twitter Profile Photo

🚀 Exciting Post-Doc Opportunity! 🚀 Join the CoML team for the new ERC project InfantSimulator ! If you're passionate about language modeling & machine learning, apply now ! École normale supérieure | PSL 📍 Paris 🔗 cognitive-ml.fr/docs/Fiche_pos… #PostDoc #CognitiveScience #LanguageModeling #Job #AI

Roberta Raileanu (@robertarail) 's Twitter Profile Photo

I’m looking for a PhD intern for next year to work at the intersection of LLM-based agents and open-ended learning, part of the Llama Research Team in London. If interested please send me an email with a short paragraph with some research ideas and apply at the link below.

Grégoire Mialon (@mialon_gregoire) 's Twitter Profile Photo

I am hiring an intern in our Llama team for 2025! Near the end of PhD completion, willing to be based out of Paris. You will succeed Dheeraj Mekala, work around frontier LLMs, tool use, agents, and more :) Please apply here: metacareers.com/jobs/109555634…

Dr. Limor Raviv 🦄🤗🐘🦒 (@limor_raviv) 's Twitter Profile Photo

New paper w/Lukas Galke! We identify several key pressures for language learning and emergence by reviewing 3 mismatches between how humans👥 and deep neural networks🤖(LLMs & emergent communication agents) behave when learning to communicate from scratch: ldr.lps.library.cmu.edu/article/id/748/

Paul Michel (@pmichelx) 's Twitter Profile Photo

Interested in working on Gemini pre-training? I'm hiring a research scientist to work on pre-training data Google DeepMind in London: boards.greenhouse.io/deepmind/jobs/… I am unfortunately not at #NeurIPS2024 but feel free to reach out to ask questions or see the team at the booth there!

Language Gamification Workshop @ NeurIPS 2024 (@mllanguagegames) 's Twitter Profile Photo

🥳 Recording of our workshop is now publicly available at neurips.cc/virtual/2024/w…! We highly recommend the panel discussion, especially the debate on inductive bias for learning 😆