
Yiying Zhang
@yiying__zhang
Founder and CEO of GenseeAI, Associate Professor of Computer Science at UCSD. LLM serving, AI Workflows, Agents
ID: 936289743511441408
30-11-2017 17:44:05
31 Tweet
1,1K Takipçi
139 Takip Edilen







Today, LLMs are constantly being augmented with tools, agents, models, RAG, etc. We built InferCept [ICML'24], the first serving framework designed for augmented LLMs. InferCept sustains a 1.6x-2x higher serving load than SOTA LLM serving systems. #AugLLM mlsys.wuklab.io/posts/infercep…








