
shreyas
@shreyasnsharma
cs and philosophy @Stanford
ID: 1700262397003309056
08-09-2023 21:38:58
52 Tweet
41 Takipçi
166 Takip Edilen


Happy Throughput Thursday! We’re excited to release Tokasaurus: an LLM inference engine designed from the ground up for high-throughput workloads with large and small models. (Joint work with Ayush Chakravarthy, Ryan Ehrlich, Sabri Eyuboglu, Bradley Brown, Joseph Shetaye,




