Amos You
@amooseyou
cs @ucberkeley
ID: 1678601958670893059
http://amosyou.com 11-07-2023 03:08:24
37 Tweet
86 Takipรงi
199 Takip Edilen
Mistral Large 3 debuts as the #1 open source coding model on the lmarena.ai leaderboard. We'd love for you to try it! More on coding in a few days... ๐
Congrats to the Mistral AI team on the launch of Devstral 2! ๐ vLLM now delivers Day-0 support for the Devstral 2 Instruct models โ optimized for agentic coding, deep codebase exploration, and multi-file editing at scale. Feel free to reach out ๐
Congrats to Mistral AI on the big launch ๐ You can check out Mistral Vibe in Zed today โ just download it and add your API key!
Proud to share the first public model I worked on at Mistral AI. A decoder-only LLM optimized for creative writing, narrative generation, roleplay, and character-driven dialogue. Now live via API as labs-mistral-small-creative docs.mistral.ai/models/mistralโฆ
new Ministral 3 tech report from Mistral AI, they train competitive small models on 1/3T tokens only. the secret? pruning + distillation distillation: > in pre-training they use Mistral Small 3.1 Instruct as a teacher for ALL variants (so not really cascade distillation, each
Incredible debugging story from Mistral AI: a memory leak in vLLM that standard tools couldn't detect. The hunt went from Python profilers โ pmap โ BPFtrace โ GDB, finally tracing it to UCX's mmap hooks. Fix merged: github.com/vllm-project/vโฆ Thanks for sharing! A must-read
Introducing Voxtral Transcribe 2, next-gen speech-to-text models by Mistral AI. State-of-the-art transcription, speaker diarization, sub-200ms real-time latency. Details in ๐งต
Congrats to Mistral AI on releasing Voxtral Mini 4B Realtime! ๐ Day-0 support in vLLM! A 4B streaming ASR model achieving <500ms latency while matching offline model accuracy, supporting 13 languages. vLLM's new Realtime API `/v1/realtime` provides audio streaming - optimized
Introducing Mistral AI's biggest hackathon ever! ๐ Feb 28 - Mar 1 ๐ Paris | London | NY | SF | Tokyo | Singapore | Sydney & online 48 hours. The best hackers. ๐ค Partners: Weights & Biases NVIDIA Amazon Web Services Iterate ๐ $200K in prizes. Special awards from ElevenLabs Hugging Face