Meor Amer (@meoramer1) Twitter Tweets • TwiCopy

a year ago

I don’t care about ASI. I just want to do less work.

thumb_up_off_alt159

chat_bubble_outline14

repeat17

shareShare

Jay Alammar

a year ago

The Illustrated DeepSeek-R1 Spent the weekend reading the paper and sorting through the intuitions. Here's a visual guide and the main intuitions to understand the model and the process that created it. Link in the first reply. All feedback welcome.

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat225

shareShare

Jay Alammar

a year ago

We're ecstatic to bring you "How Transformer LLMs Work" -- a free course with ~90 minutes of video, code, and crisp visuals and animations that explain the modern Transformer architecture, tokenizers, embeddings, and mixture-of-expert models. Maarten Grootendorst and I have developed a

thumb_up_off_alt1,1K

chat_bubble_outline28

repeat217

shareShare

Meor Amer

a year ago

Aya Vision is here!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

cohere

@cohere

a year ago

We’re excited to introduce our newest state-of-the-art model: Command A! Command A provides enterprises maximum performance across agentic tasks with minimal compute requirements.

thumb_up_off_alt1,1K

chat_bubble_outline28

repeat200

shareShare

Meor Amer

a year ago

Introducing our new model - Command A! On par/better than GPT-4o & DeepSeek-V3 in agentic enterprise tasks. Much more efficient (just 2 GPUs) and much faster (156 tokens/sec). Read all about it here: cohere.com/blog/command-a

thumb_up_off_alt31

chat_bubble_outline0

repeat6

shareShare

Nick Frosst

a year ago

I added cohere command A to this chart, I had to extend the axis a bit though….

I added <a href="/cohere/">cohere</a> command A to this chart, I had to extend the axis a bit though….

thumb_up_off_alt693

chat_bubble_outline33

repeat47

shareShare

Nick Frosst

a year ago

UPDATE: my numbers were off, external benchmarking actually shows we are faster and better. GPQA-diamond: 53% miliseconds per token: 5.36 artificialanalysis.ai/providers/cohe…

thumb_up_off_alt151

chat_bubble_outline10

repeat12

shareShare

Max Bartolo

@max_nlp

a year ago

I'm excited to the tech report for our @Cohere Cohere For AI Command A and Command R7B models. We highlight our novel approach to model training including the use of self-refinement algorithms and model merging techniques at scale. Command A is an efficient, agent-optimised

I'm excited to the tech report for our @Cohere <a href="/CohereForAI/">Cohere For AI</a> Command A and Command R7B models. We highlight our novel approach to model training including the use of self-refinement algorithms and model merging techniques at scale. Command A is an efficient, agent-optimised

thumb_up_off_alt278

chat_bubble_outline9

repeat76

shareShare

Jay Alammar

a year ago

And for the first time, the Command A paper IS ITSELF The Illustrated Command A 🥲

thumb_up_off_alt27

chat_bubble_outline1

repeat4

shareShare

Meor Amer

9 months ago

Command vision is here!

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Meor Amer

9 months ago

Say hello to North - the secure and customizable AI agent platform

thumb_up_off_alt23

chat_bubble_outline0

repeat2

shareShare

Nick Frosst

9 months ago

AI does the boring work, you do the creative work.

thumb_up_off_alt134

chat_bubble_outline6

repeat8

shareShare

Jay Alammar

9 months ago

As one of the earliest builders of LLMs, Cohere realized early that enterprises need more than a model -- they need: 1) a secure solution (with private deployment) 2) that connects to their data (Salesforce, email, Slack, or internally defined with MCP) 3) Is powered by an LLM

thumb_up_off_alt90

chat_bubble_outline1

repeat9

shareShare

Nick Frosst