Amos You
@amooseyou
cs @ucberkeley
ID: 1678601958670893059
http://amosyou.com 11-07-2023 03:08:24
37 Tweet
86 Followers
199 Following
Mistral Large 3 debuts as the #1 open source coding model on the lmarena.ai leaderboard. We'd love for you to try it! More on coding in a few days... π
Congrats to the Mistral AI team on the launch of Devstral 2! π vLLM now delivers Day-0 support for the Devstral 2 Instruct models β optimized for agentic coding, deep codebase exploration, and multi-file editing at scale. Feel free to reach out π
Congrats to Mistral AI on the big launch π You can check out Mistral Vibe in Zed today β just download it and add your API key!
Proud to share the first public model I worked on at Mistral AI. A decoder-only LLM optimized for creative writing, narrative generation, roleplay, and character-driven dialogue. Now live via API as labs-mistral-small-creative docs.mistral.ai/models/mistralβ¦
new Ministral 3 tech report from Mistral AI, they train competitive small models on 1/3T tokens only. the secret? pruning + distillation distillation: > in pre-training they use Mistral Small 3.1 Instruct as a teacher for ALL variants (so not really cascade distillation, each
Incredible debugging story from Mistral AI: a memory leak in vLLM that standard tools couldn't detect. The hunt went from Python profilers β pmap β BPFtrace β GDB, finally tracing it to UCX's mmap hooks. Fix merged: github.com/vllm-project/vβ¦ Thanks for sharing! A must-read
Introducing Voxtral Transcribe 2, next-gen speech-to-text models by Mistral AI. State-of-the-art transcription, speaker diarization, sub-200ms real-time latency. Details in π§΅
Congrats to Mistral AI on releasing Voxtral Mini 4B Realtime! π Day-0 support in vLLM! A 4B streaming ASR model achieving <500ms latency while matching offline model accuracy, supporting 13 languages. vLLM's new Realtime API `/v1/realtime` provides audio streaming - optimized
Introducing Mistral AI's biggest hackathon ever! π Feb 28 - Mar 1 π Paris | London | NY | SF | Tokyo | Singapore | Sydney & online 48 hours. The best hackers. π€ Partners: Weights & Biases NVIDIA Amazon Web Services Iterate π $200K in prizes. Special awards from ElevenLabs Hugging Face