 
                                LLM360
@llm360
LLM360 is an open research lab enabling community-owned AGI through open-source large model research and development.
ID: 1722322046905143296
https://www.llm360.ai 08-11-2023 18:36:36
110 Tweet
2,2K Takipçi
54 Takip Edilen
 
         
        Looking for EleutherAI at #ICLR2025? Come say hi at any of our five posters or the Open Science for Foundation Models workshop where Stella Biderman is giving the opening keynote. 🧵
 
         
         
        KV-caching is great, but will it work for Diffusion Language Models. Zhihan Yang and team showed how to make it work with 65x speedup 🚀! Checkout the new preprint: arxiv.org/abs/2506.01928 The LLM360 team is very interested to explore new architectures.
 
        Our team is lucky to have "early access" of this work from the IFM talk given by Subham Sahoo
 
                        
                    
                    
                    
                 
         
                        ![Shangshang Wang (@upupwang) on Twitter photo 😋 Want strong LLM reasoning without breaking the bank? We explored just how cost-effectively RL can enhance reasoning using LoRA!
[1/9] Introducing Tina: A family of tiny reasoning models with strong performance at low cost, providing an accessible testbed for RL reasoning. 🧵 😋 Want strong LLM reasoning without breaking the bank? We explored just how cost-effectively RL can enhance reasoning using LoRA!
[1/9] Introducing Tina: A family of tiny reasoning models with strong performance at low cost, providing an accessible testbed for RL reasoning. 🧵](https://pbs.twimg.com/media/GpO_7AKbwAA_jyf.jpg) 
                        