Mathieu (@mathieu_rita) Twitter Tweets • TwiCopy

CoML (Cognitive Machine Learning) | @ENS

2 years ago

[🚨Recruitment🗣️] CoML team is actively recruiting a Postdoctoral Fellow with expertise in machine learning, linguistics, or cognitive science. More information in the detailed announcement here : cognitive-ml.fr/docs/fiche_pos…

thumb_up_off_alt20

chat_bubble_outline0

repeat13

shareShare

Baptiste Rozière

@b_roziere

2 years ago

We released a 70B version of CodeLlama today! Trained on 1T tokens, it is a much stronger base model for coding tasks. I look forward to seeing what the community will do with it! :)

thumb_up_off_alt145

chat_bubble_outline3

repeat31

shareShare

AK

@_akhaliq

2 years ago

Meta presents SpiRit-LM Interleaved Spoken and Written Language Model paper page: huggingface.co/papers/2402.05… introduce SPIRIT-LM, a foundation multimodal language model that freely mixes text and speech. Our model is based on a pretrained text language model that we extend to the

thumb_up_off_alt122

chat_bubble_outline1

repeat27

shareShare

fly51fly

@fly51fly

2 years ago

[CL] Language Evolution with Deep Learning M Rita, P Michel, R Chaabouni, O Pietquin, E Dupoux, F Strub [INRIA & Google DeepMind] (2024) arxiv.org/abs/2403.11958 - Deep learning is well-suited for simulating communication games and studying language emergence and evolution. -

thumb_up_off_alt16

chat_bubble_outline0

repeat9

shareShare

AI at Meta

@aiatmeta

2 years ago

Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3

thumb_up_off_alt5,5K

chat_bubble_outline344

repeat1,1K

shareShare

Rui Hou

@magpie_rayhou

2 years ago

Excited to release a preview version of Llama3 with superb performance to the community! More to come soon!

thumb_up_off_alt31

chat_bubble_outline2

repeat4

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

2 years ago

Exciting update -- Llama-3 full result is out, now reaching top-5 on the Arena leaderboard🔥 We've got stable enough CIs with over 12K votes. No question now Llama-3 70B is the new king of open model. Its powerful 8B variant has also surpassed many larger-size models. What an

thumb_up_off_alt1,1K

chat_bubble_outline30

repeat157

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

2 years ago

Moreover, we observe even stronger performance in English category, where Llama 3 ranking jumps to ~1st place with GPT-4-Turbo! It consistently performs strong against top models (see win-rate matrix) by human preference. It's been optimized for dialogue scenario with large

thumb_up_off_alt381

chat_bubble_outline11

repeat41

shareShare

Thomas Scialom

@thomasscialom

2 years ago

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community joined us with Hugging Face, kyutai, Google DeepMind (Gemma), cohere As someone said: better that the building remains safe, or ciao the open source for AI 😆

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community joined us with <a href="/huggingface/">Hugging Face</a>, <a href="/kyutai_labs/">kyutai</a>, <a href="/GoogleDeepMind/">Google DeepMind</a> (Gemma), <a href="/cohere/">cohere</a>
As someone said: better that the building remains safe, or ciao the open source for AI 😆

thumb_up_off_alt232

chat_bubble_outline14

repeat9

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

2 years ago

Exciting new blog -- What’s up with Llama-3? Since Llama 3’s release, it has quickly jumped to top of the leaderboard. We dive into our data and answer below questions: - What are users asking? When do users prefer Llama 3? - How challenging are the prompts? - Are certain users

thumb_up_off_alt723

chat_bubble_outline14

repeat116

shareShare

Yann LeCun

@ylecun

a year ago

💥BOOM 💥 Llama 3.1 is out 💥 405B, 70B, 8B versions. Main takeaways: 1. 405B performance is on par with the best closed models. 2. Open/free weights and code, with a license that enables fine-tuning, distillation into other models, and deployment anywhere. 3. 128k context

thumb_up_off_alt6,6K

chat_bubble_outline229

repeat946

shareShare

Jérémie Kalfon

@jkobject

a year ago

This allows scPRINT zero-shot abilities -meaning no fine-tuning required- such as artificially increasing the depth of the expression profile of a cell (denoising / zero imputation), predicting the cell type, disease, sequencer, and sex of a cell, as well as creating cell

thumb_up_off_alt10

chat_bubble_outline1

repeat2

shareShare

Language Gamification Workshop @ NeurIPS 2024

@mllanguagegames

a year ago

🔔🚨 [ALERT] Calls for papers! 🚨🔔 Language Gamification Workshop @ NeurIPS 2024 openreview.net/group?id=NeurI… 🤔 Topics: In-Context Learning, Deep Reinforcement Learning, Modern NLP, Multi-Agent Learning, Language Emergence, Embodiment, Cognitive Science... ⏰ Deadline: August 30

thumb_up_off_alt16

chat_bubble_outline3

repeat7

shareShare

Rui Hou

@magpie_rayhou

a year ago

Our team, Llama Post-training, is looking to hire 2025 PhD Research Interns to join us at Meta GenAI. If you are interested in working on RL for LLM, Code Generation, Reasoning, and Agents with us, drop me a message with your CV. Link: metacareers.com/jobs/106355302…

thumb_up_off_alt10

chat_bubble_outline0

repeat2

shareShare

CoML (Cognitive Machine Learning) | @ENS

@coml_ens

a year ago

🚀 Exciting Post-Doc Opportunity! 🚀 Join the CoML team for the new ERC project InfantSimulator ! If you're passionate about language modeling & machine learning, apply now ! École normale supérieure | PSL 📍 Paris 🔗 cognitive-ml.fr/docs/Fiche_pos… #PostDoc #CognitiveScience #LanguageModeling #Job #AI

thumb_up_off_alt8

chat_bubble_outline0

repeat6

shareShare

Roberta Raileanu

@robertarail

a year ago

I’m looking for a PhD intern for next year to work at the intersection of LLM-based agents and open-ended learning, part of the Llama Research Team in London. If interested please send me an email with a short paragraph with some research ideas and apply at the link below.

thumb_up_off_alt573

chat_bubble_outline11

repeat104

shareShare

Grégoire Mialon

@mialon_gregoire

a year ago

I am hiring an intern in our Llama team for 2025! Near the end of PhD completion, willing to be based out of Paris. You will succeed Dheeraj Mekala, work around frontier LLMs, tool use, agents, and more :) Please apply here: metacareers.com/jobs/109555634…

thumb_up_off_alt302

chat_bubble_outline5

repeat41

shareShare

Dr. Limor Raviv 🦄🤗🐘🦒

@limor_raviv

a year ago

New paper w/Lukas Galke! We identify several key pressures for language learning and emergence by reviewing 3 mismatches between how humans👥 and deep neural networks🤖(LLMs & emergent communication agents) behave when learning to communicate from scratch: ldr.lps.library.cmu.edu/article/id/748/

thumb_up_off_alt17

chat_bubble_outline1

repeat2

shareShare

Paul Michel

@pmichelx

a year ago

Interested in working on Gemini pre-training? I'm hiring a research scientist to work on pre-training data Google DeepMind in London: boards.greenhouse.io/deepmind/jobs/… I am unfortunately not at #NeurIPS2024 but feel free to reach out to ask questions or see the team at the booth there!

thumb_up_off_alt162

chat_bubble_outline2

repeat31

shareShare

Language Gamification Workshop @ NeurIPS 2024

@mllanguagegames

10 months ago

🥳 Recording of our workshop is now publicly available at neurips.cc/virtual/2024/w…! We highly recommend the panel discussion, especially the debate on inductive bias for learning 😆

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare