Jerome Good
@clvswft03
Good Day
ID: 61996192
01-08-2009 08:39:16
71 Tweet
10 Followers
334 Following
mistral.rs is a Rust-based inference engine that offers blazing fast serving for local LLMs ⚡️ Built on top of candle by Hugging Face, it comes with a slew of features and supports all the latest models: 🔥 Features: Flash attention v2, prefix caching, 2-8bit