 
                                Lequn Chen
@abcdabcd987
Faster and cheaper LLM inference.
ID: 470747166
https://abcdabcd987.com 22-01-2012 03:26:52
25 Tweet
893 Takipçi
559 Takip Edilen
 
         
        Really good observation from Tianle Cai and Junru Shao . I did a quick sanity check. Delta between Mixtral 8x7B MoE and Mistral 7B is NOT low-rank. SGMV is not applicable here. We need new research :)
 
                        
                    
                    
                    
                 
         
         
         
        Go Lequn Chen (Lequn Chen)! Great work on making lots LoRAs cheap to serve. Nice collaboration with Zihao Ye Arvind Krishnamurthy and others! #mlsys24 arxiv.org/abs/2310.18547
 
                        
                    
                    
                    
                 
         
         
         
         
         
         
         
         
         
         
         
                         
                         
                        