Dwarak Rajagopal (@dwarak) 's Twitter Profile
Dwarak Rajagopal

@dwarak

VP/Head of AI Eng @ Snowflake, ex-{Google, FB/Meta, Uber, Apple, AMD}

ID: 17052951

calendar_today29-10-2008 21:46:40

220 Tweet

437 Followers

435 Following

Dwarak Rajagopal (@dwarak) 's Twitter Profile Photo

Arctic Ulysses from SnowflakeDB cuts TTFT by 6.8x for long-context LLMs with sequence parallelism. A game-changer for inference! 🚀 Read more: snowflake.com/en/engineering… #AI #LLM #Inference Kudos to Mert, Aurick Qiao, Jeff Rasley, Yuxiong He and Samyam Rajbhandari

Dwarak Rajagopal (@dwarak) 's Twitter Profile Photo

The future of enterprise AI is data-native. With Meta’s Llama 4 models now in Cortex AI, developers can build intelligent, multimodal applications and agents directly on their data—securely, efficiently, and at scale. This is just the beginning. x.com/SnowflakeDB/st…

The future of enterprise AI is data-native.

With Meta’s Llama 4 models now in Cortex AI, developers can build intelligent, multimodal applications and agents directly on their data—securely, efficiently, and at scale.

This is just the beginning.
 x.com/SnowflakeDB/st…
Dwarak Rajagopal (@dwarak) 's Twitter Profile Photo

Blazing fast inference! 🚀 Aurick Qiao shares how Arctic Inference + vLLM achieves the fastest LLM inference yet—up to 4x speedups. Best part? It's all open-sourced for the community! 💻 #AI #OpenSource #vLLM

Casper Hansen (@casper_hansen_) 's Twitter Profile Photo

Almost a 5x speedup in vLLM🤯 I was able to push a finetuned Mistral Nemo from 110 tokens/s to a peak of 517 tokens/s and acceptance rate of 57.7%. This is with Suffix Decoding from ArcticInference⚡

Dwarak Rajagopal (@dwarak) 's Twitter Profile Photo

Exciting news! The PyTorch Foundation’s expansion with vLLM and DeepSpeed is a game-changer for open-source AI. Can’t wait to see the innovations this brings! As a premier member, Snowflake is excited to join the Board and help grow the OSS community. Big things ahead! 🚀

Dwarak Rajagopal (@dwarak) 's Twitter Profile Photo

🌟 Thrilled to announce Snowflake AI Research’s latest breakthroughs in Text-to-SQL! 🚀 ✅ #1 on the BIRD leaderboard, surpassing SOTA by 2.8% with Arctic-Text2SQL-R1-32B! ✅ #1 on Spider 2.0, mastering real-world challenges with groundbreaking innovation! ❄️ Our team at

Dwarak Rajagopal (@dwarak) 's Twitter Profile Photo

Huge props to Hao Zhang and Hao AI Lab for FastVideo V1! This makes state-of-the-art video generation accessible and fast, revolutionizing how we approach distributed computing in AI. A must-try for every developer!

Dwarak Rajagopal (@dwarak) 's Twitter Profile Photo

🤝 A new era for AI development! Percy Liang's Marin lab is redefining open-source AI with open development—fully transparent and collaborative. Join the movement! #AIInnovation #OpenScience

🤝 A new era for AI development! <a href="/percyliang/">Percy Liang</a>'s  Marin lab is redefining open-source AI with open development—fully transparent and collaborative. Join the movement!  #AIInnovation #OpenScience
Dwarak Rajagopal (@dwarak) 's Twitter Profile Photo

Shift Parallelism from Snowflake AI Research is a game-changer! 🚀 3.4x faster LLM inference with Arctic + vLLM . Loving the throughput boost!

Percy Liang (@percyliang) 's Twitter Profile Photo

Wrapped up Stanford CS336 (Language Models from Scratch), taught with an amazing team Tatsunori Hashimoto Marcel Rød Neil Band Rohith Kuditipudi. Researchers are becoming detached from the technical details of how LMs work. In CS336, we try to fix that by having students build everything:

Stas Bekman (@stasbekman) 's Twitter Profile Photo

As I'm diving into Sequence/Context parallelism in the last few days I wanted to share this write up in 2 parts that nicely compares the few approaches out there and some of their combinations with papers: p1: insujang.github.io/2024-01-11/ten… p2: insujang.github.io/2024-09-20/int…

Stas Bekman (@stasbekman) 's Twitter Profile Photo

Today is a deep dive into sequence tiling compute. Sequence tiling massively reduces activation memory footprint and can be applied to computations w/o token inter-dependency. The plot shows a huge memory saving with tiled fused logits loss computation. See section 3.1 in our

Today is a deep dive into sequence tiling compute.

Sequence tiling massively reduces activation memory footprint and can be applied to computations w/o token inter-dependency. The plot shows a huge memory saving with tiled fused logits loss computation.

See section 3.1 in our
Sriram Krishnan (@sriramk) 's Twitter Profile Photo

🇺🇸 Today is a day we have been working towards for six months. We are announcing America’s AI action plan putting us on the road to continued AI dominance. The three core themes: - Accelerate AI innovation - Build American AI infrastructure - Lead in international AI

🇺🇸

Today is a day we have been working towards for six months. We are announcing America’s AI action plan putting us on the road to continued AI dominance. 

The three core themes:
- Accelerate AI innovation
- Build American AI infrastructure 
- Lead in international AI
Snowflake (@snowflakedb) 's Twitter Profile Photo

We are thrilled to announce that OpenAI’s most advanced model, GPT-5, is now available natively on Snowflake Cortex AI for customers to use. This integration unlocks a wide range of enterprise use cases within Snowflake’s secure, governed environment: ❄️ Transform data into