Avinash Sooriyarachchi
@avitwit3
Building AI Systems @mistralai
ID: 1377740842455216128
https://github.com/avisoori1x 01-04-2021 21:53:15
28 Tweet
111 Followers
116 Following
I wanted to extend the simple from-scratch MoE LM implementation I wrote with expert capacity. Given Grok-1 is open source, hope this helps understand MoEs a bit better. Again the base for this is makemore/ nanoGPT fromAndrej Karpathy huggingface.co/blog/AviSoori1…
I’ve seen a lot of interest from developers to reduce cost and deploy LLMs on device. With these new models from Mistral AI and our QAT stack, on device deployments without degradation is a reality. Amazing work Pierre Stock Sandeep Subramanian Teven Le Scao and team!!
I learned yesterday the video I made in 2017 explaining how Bitcoin works was taken down, and my channel received a copyright strike (despite it being 100% my own content). The request seems to have been issued by a company chainpatrol, on behalf of Arbitrum, whose website says
Proud to share the first public model I worked on at Mistral AI. A decoder-only LLM optimized for creative writing, narrative generation, roleplay, and character-driven dialogue. Now live via API as labs-mistral-small-creative docs.mistral.ai/models/mistral…