Tom Sherborne (@tomsherborne) Twitter Tweets • TwiCopy

Cohere Labs

9 months ago

In our latest work, we ask “Can model merging help with task tradeoffs over models obtained from different training runs”? We extend model merging to a setup where you have many *generalist* LLM checkpoints showing performance tradeoffs.

thumb_up_off_alt26

chat_bubble_outline1

repeat8

shareShare

cohere

@cohere

9 months ago

Introducing Command R7B: the smallest, fastest, and final model in our R series of enterprise-focused LLMs! It delivers a powerful combination of state-of-the-art performance in its class and efficiency to lower the cost of building AI applications. cohere.com/blog/command-r…

thumb_up_off_alt496

chat_bubble_outline11

repeat113

shareShare

Daniel San

@dani_avila7

9 months ago

Trying out Command R7B in VSCode, and the model performs brilliantly! 👏 The latest model from cohere Command family shows excellent performance working with code inside VSCode, using CodeGPT to integrate the model. Congrats to the Cohere team! 🥳 If you want to use Cohere's

thumb_up_off_alt70

chat_bubble_outline3

repeat16

shareShare

Ivan Zhang

@1vnzh

8 months ago

mr. pretraining Acyr Locatelli is looking for intern to start in Jan, DM him if u can code

thumb_up_off_alt64

chat_bubble_outline2

repeat4

shareShare

Tom Sherborne

@tomsherborne

6 months ago

We are hiring cohere for an Agent Infrastructure Engineer! If you want to work on building the next generation of agent models for #RAG, #ToolUse #Code, #Reasoning and more then apply here. DM me if you have any Qs. jobs.ashbyhq.com/cohere/3f797fe…

thumb_up_off_alt28

chat_bubble_outline0

repeat3

shareShare

Command A(idan)

@aidangomez

6 months ago

Today cohere is very excited to introduce Command A, our new model succeeding Command R+. Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding usecases. 🧵

Today <a href="/cohere/">cohere</a> is very excited to introduce Command A, our new model succeeding Command R+. Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding usecases. 🧵

thumb_up_off_alt826

chat_bubble_outline30

repeat120

shareShare

Nick Frosst

@nickfrosst

6 months ago

Today we are releasing Command A - Cohere’s newest model that offers enterprises powerful AI with minimum hardware :) It beats out bigger, slower models on enterprise agentic task performance, and can run on just two GPUs. Learn more about it: cohere.com/blog/command-a/

thumb_up_off_alt159

chat_bubble_outline3

repeat31

shareShare

Max Bartolo

@max_nlp

5 months ago

I'm excited to the tech report for our @Cohere Cohere For AI Command A and Command R7B models. We highlight our novel approach to model training including the use of self-refinement algorithms and model merging techniques at scale. Command A is an efficient, agent-optimised

I'm excited to the tech report for our @Cohere <a href="/CohereForAI/">Cohere For AI</a> Command A and Command R7B models. We highlight our novel approach to model training including the use of self-refinement algorithms and model merging techniques at scale. Command A is an efficient, agent-optimised

thumb_up_off_alt278

chat_bubble_outline9

repeat76

shareShare

Seraphina Goldfarb-Tarrant

@seraphinagt

5 months ago

Today (two weeks after model launch 🔥) we're releasing a technical report of how we made Command A and R7B 🚀! It has detailed breakdowns of our training process, and evaluations per capability (tools, multilingual, code, reasoning, safety, enterprise, long context)🧵 1/3.

thumb_up_off_alt63

chat_bubble_outline2

repeat26

shareShare

Jay Alammar

@jayalammar

5 months ago

Don't miss the detailed tech report for how we created Command A -- Cohere's flagship LLM!

thumb_up_off_alt117

chat_bubble_outline1

repeat14

shareShare

Tom Sherborne

@tomsherborne

5 months ago

Your next COBOL dev is a cohere model (Source: I made these tables)

thumb_up_off_alt38

chat_bubble_outline0

repeat5

shareShare

cohere

@cohere

5 months ago

We’re redefining what’s possible with AI. With the release of our latest model, Command A, optimized for real-world agentic and multilingual tasks, we’re demonstrating our commitment to bringing enterprises AI that goes beyond the ordinary, and offers security & efficiency.

thumb_up_off_alt116

chat_bubble_outline9

repeat29

shareShare

Yannis Flet-Berliac

@yfletberliac

5 months ago

Excited to finally share that CoPG — the RL method I co-authored with Nathan Grinsztajn and amazing colleagues — was used throughout the post-training (offline & online learning) of cohere’s new Command models! 🖊️ Tech report: cohere.com/research/paper… 🤖 CoPG: arxiv.org/abs/2406.19185

Excited to finally share that CoPG — the RL method I co-authored with <a href="/NGrinsztajn/">Nathan Grinsztajn</a> and amazing colleagues — was used throughout the post-training (offline & online learning) of <a href="/cohere/">cohere</a>’s new Command models!

🖊️ Tech report: cohere.com/research/paper…
🤖 CoPG: arxiv.org/abs/2406.19185

thumb_up_off_alt51

chat_bubble_outline3

repeat9

shareShare

Alex Gurung

@alexaag1234

5 months ago

Preprint: Can we learn to reason for story generation (~100k tokens), without reward models? Yes! We introduce an RLVR-inspired reward paradigm VR-CLI that correlates with human judgements of quality on the 'novel' task of Next-Chapter Prediction. Paper: arxiv.org/abs/2503.22828

thumb_up_off_alt324

chat_bubble_outline7

repeat48

shareShare

cohere

@cohere

4 months ago

Command A, our state-of-the-art generative model, is now the highest-scoring generalist LLM on the Bird Bench leaderboard for SQL! It outperforms other systems that rely on extensive scaffolding to tackle these SQL benchmarks, and instead delivers these results out-of-the-box,

thumb_up_off_alt173

chat_bubble_outline4

repeat18

shareShare