Dwarak Rajagopal
@dwarak
VP/Head of AI Eng @ Snowflake, ex-{Google, FB/Meta, Uber, Apple, AMD}
ID: 17052951
29-10-2008 21:46:40
220 Tweet
437 Followers
435 Following
Arctic Ulysses from SnowflakeDB cuts TTFT by 6.8x for long-context LLMs with sequence parallelism. A game-changer for inference! 🚀 Read more: snowflake.com/en/engineering… #AI #LLM #Inference Kudos to Mert, Aurick Qiao, Jeff Rasley, Yuxiong He and Samyam Rajbhandari
Blazing fast inference! 🚀 Aurick Qiao shares how Arctic Inference + vLLM achieves the fastest LLM inference yet—up to 4x speedups. Best part? It's all open-sourced for the community! 💻 #AI #OpenSource #vLLM

🤝 A new era for AI development! Percy Liang's Marin lab is redefining open-source AI with open development—fully transparent and collaborative. Join the movement! #AIInnovation #OpenScience


Wrapped up Stanford CS336 (Language Models from Scratch), taught with an amazing team Tatsunori Hashimoto Marcel Rød Neil Band Rohith Kuditipudi. Researchers are becoming detached from the technical details of how LMs work. In CS336, we try to fix that by having students build everything:


Arctic Inference helps All Hands AI complete real-world coding tasks 2x faster through faster LLM inference. Check it out!


