 
                                Yiying Zhang
@yiying__zhang
Founder and CEO of GenseeAI, Associate Professor of Computer Science at UCSD. LLM serving, AI Workflows, Agents
ID: 936289743511441408
30-11-2017 17:44:05
31 Tweet
1,1K Followers
139 Following
 
         
         
         
         
         
         
        Today, LLMs are constantly being augmented with tools, agents, models, RAG, etc. We built InferCept [ICML'24], the first serving framework designed for augmented LLMs. InferCept sustains a 1.6x-2x higher serving load than SOTA LLM serving systems. #AugLLM mlsys.wuklab.io/posts/infercep…
 
         
         
         
         
         
         
         
         
         
        ![Yizhou Shan (@yizhou_shan) on Twitter photo Clio is an hardware-based (FPGA) memory disaggregation solution with a new virtual memory system, a customized transport, and a framework for computation offloading. [2/2] Clio is an hardware-based (FPGA) memory disaggregation solution with a new virtual memory system, a customized transport, and a framework for computation offloading. [2/2]](https://pbs.twimg.com/media/FEVkY7uVcAAZyMf.jpg) 
                         
                         
                        