Acyr Locatelli (@acyr_l) Twitter Tweets • TwiCopy

Cohere Labs

a year ago

Introducing ✨Aya Expanse ✨ – an open-weights state-of-art family of models to help close the language gap with AI. Aya Expanse is both global and local. Driven by a multi-year commitment to multilingual research. cohere.com/research/aya

thumb_up_off_alt430

chat_bubble_outline15

repeat141

shareShare

Max Bartolo

@max_nlp

a year ago

Our Command R+ model is one of TIME's 200 Best Inventions of 2024! 🚀 Try it out at coral.cohere.com 🌐 time.com/collection/bes…

thumb_up_off_alt21

chat_bubble_outline0

repeat4

shareShare

Laura Ruis

@lauraruis

a year ago

How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this: Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢 🧵⬇️

thumb_up_off_alt966

chat_bubble_outline24

repeat208

shareShare

Cohere Labs

@cohere_labs

a year ago

A moment for @Cohere and Cohere For AI team appreciation. 💙 #NeurIPS2024 - stop by the booth to catch up with our team, or find us throughout the conference.

A moment for @Cohere and <a href="/CohereForAI/">Cohere For AI</a> team appreciation. 💙

#NeurIPS2024 - stop by the booth to catch up with our team, or find us throughout the conference.

thumb_up_off_alt100

chat_bubble_outline4

repeat18

shareShare

cohere

@cohere

a year ago

Introducing Command R7B: the smallest, fastest, and final model in our R series of enterprise-focused LLMs! It delivers a powerful combination of state-of-the-art performance in its class and efficiency to lower the cost of building AI applications. cohere.com/blog/command-r…

thumb_up_off_alt496

chat_bubble_outline11

repeat113

shareShare

Max Bartolo

@max_nlp

10 months ago

Check out Laura Ruis on Machine Learning Street Talk discussing her work on understanding LLM reasoning at cohere along with fantastic collaborators 🔥

thumb_up_off_alt34

chat_bubble_outline1

repeat6

shareShare

Acyr Locatelli

@acyr_l

9 months ago

I'm hiring performance engineers for the pre-training team at Cohere. If you enjoy writing efficient kernels, hardware-aligned architecture design and optimisations, do reach out! Check out the live job posting here: jobs.ashbyhq.com/cohere/d42f5fd…

thumb_up_off_alt154

chat_bubble_outline2

repeat34

shareShare

AK

@_akhaliq

9 months ago

Cohere releases Command A on Hugging Face Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Command A is on par or better than models like GPT-4o and Deepseek

thumb_up_off_alt259

chat_bubble_outline8

repeat49

shareShare

Acyr Locatelli

@acyr_l

9 months ago

Really proud of the work that went into pre-training this model!

thumb_up_off_alt65

chat_bubble_outline2

repeat10

shareShare

Ivan Zhang

@1vnzh

9 months ago

shoutout to the ascii team

thumb_up_off_alt32

chat_bubble_outline0

repeat2

shareShare

lmarena.ai (formerly lmsys.org)

@lmarena_ai

9 months ago

🚀 Big news cohere's latest Command A now climbs to #13 on Arena! Another organization joining the top-15 club - congrats to the Cohere team! Highlights: - open-weight model (111B) - 256K context window - $2.5/$10 input/output MTok More analysis👇

🚀 Big news <a href="/cohere/">cohere</a>'s latest Command A now climbs to #13 on Arena!

Another organization joining the top-15 club - congrats to the Cohere team!

Highlights:
- open-weight model (111B)
- 256K context window
- $2.5/$10 input/output MTok

More analysis👇

thumb_up_off_alt241

chat_bubble_outline3

repeat42

shareShare

Nick Frosst

@nickfrosst

9 months ago

I added cohere command A to this chart, I had to extend the axis a bit though….

I added <a href="/cohere/">cohere</a> command A to this chart, I had to extend the axis a bit though….

thumb_up_off_alt693

chat_bubble_outline33

repeat47

shareShare

Acyr Locatelli

@acyr_l

8 months ago

Highly recommend working with Ed! He comes in handy.

thumb_up_off_alt61

chat_bubble_outline1

repeat2

shareShare

Acyr Locatelli

@acyr_l

8 months ago

Great interview! (even at 1x)

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Cohere Labs

@cohere_labs

8 months ago

Excited to announce that @Cohere and Cohere Labs models are the first supported inference provider on Hugging Face Hub! 🔥 Looking forward to this new avenue for sharing and serving our models, including the Aya family and Command suite of models.

Excited to announce that @Cohere and <a href="/Cohere_Labs/">Cohere Labs</a> models are the first supported inference provider on <a href="/huggingface/">Hugging Face</a> Hub! 🔥

Looking forward to this new avenue for sharing and serving our models, including the Aya family and Command suite of models.

thumb_up_off_alt140

chat_bubble_outline9

repeat29

shareShare

Nando de Freitas

@nandodf

7 months ago

RL is not all you need, nor attention nor Bayesianism nor free energy minimisation, nor an age of first person experience. Such statements are propaganda. You need thousands of people working hard on data pipelines, scaling infrastructure, HPC, apps with feedback to drive

thumb_up_off_alt1,1K

chat_bubble_outline31

repeat193

shareShare

Sara Hooker

@sarahookr

7 months ago

Very proud of this work which is being presented ICLR 2026 later today. While I will not be there — Catch up with Viraat Aryabumi and Ahmet Üstün who are both fantastic and can share more about our work at both Cohere Labs and cohere. 🔥✨