Meor Amer
@meoramer1
Developer relations @Cohere
ID: 1479004206874849280
06-01-2022 08:18:11
284 Tweet
868 Followers
661 Following
We're ecstatic to bring you "How Transformer LLMs Work" -- a free course with ~90 minutes of video, code, and crisp visuals and animations that explain the modern Transformer architecture, tokenizers, embeddings, and mixture-of-expert models. Maarten Grootendorst and I have developed a
I'm excited to the tech report for our @Cohere Cohere For AI Command A and Command R7B models. We highlight our novel approach to model training including the use of self-refinement algorithms and model merging techniques at scale. Command A is an efficient, agent-optimised