kourosh hakhamaneshi (@cyrushakha) Twitter Tweets • TwiCopy

Rohan Paul

2 months ago

Morgan Stanley Research says OpenAI makes up around $330B of the $880B total future contract value (RPO) tied to Microsoft, Oracle, and CoreWeave, so a lot of supplier growth depends directly on OpenAI’s stability. That means about 66% of Oracle’s and about 40% of CoreWeave’s

thumb_up_off_alt729

chat_bubble_outline29

repeat128

shareShare

Robert Nishihara

@robertnishihara

a month ago

Ray Summit is going to be excellent. Can't wait to hear from xAI, Perplexity, Cursor, Thinking Machines, Physical Intelligence, Applied Intuition, Prime Intellect, vLLM, and so many others. Some major themes this year: - Reinforcement learning infra - Multimodal data (lots of

thumb_up_off_alt247

chat_bubble_outline8

repeat28

shareShare

kourosh hakhamaneshi

@cyrushakha

a month ago

Team is cooking 🧑‍🍳

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Daniel Nguyen ⚡

@daniel_nguyenx

a month ago

We’ve found the root cause of AWS outage today. Sorry guys.

thumb_up_off_alt1,1K

chat_bubble_outline37

repeat203

shareShare

kourosh hakhamaneshi

@cyrushakha

a month ago

What a terrible night for on-calls 🛏️

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Seiji Eicher

@seiji_________

a month ago

Hi SF folks! I’ll be speaking at this meetup on inference systems this Thursday, 10/23 @ 6PM. It should be a cool event (other speakers from Meta Superintelligence, SGLang, and the Laude Institute) Hope to see you there 🤠 luma.com/brn77bc8?tk=JI…

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Anyscale

@anyscalecompute

a month ago

Today we’re donating Ray to The Linux Foundation under the PyTorch Foundation with PyTorch + vLLM, strengthening the open compute fabric for AI. Ensures long-term neutrality, open governance, and ecosystem alignment. Blog: na2.hubs.ly/H01JydX0 Ray Summit (Nov 3–5, SF):

thumb_up_off_alt62

chat_bubble_outline2

repeat11

shareShare

PyTorch

@pytorch

a month ago

We’re excited to welcome Ray to the PyTorch Foundation 👋 ray is an open source distributed computing framework for #AI workloads, including data processing, model training and inference at scale. By contributing Ray to the PyTorch Foundation, Anyscale

We’re excited to welcome Ray to the PyTorch Foundation 👋 <a href="/raydistributed/">ray</a> is an open source distributed computing framework for #AI workloads, including data processing, model training and inference at scale. By contributing Ray to the <a href="/PyTorch/">PyTorch</a> Foundation, <a href="/anyscalecompute/">Anyscale</a>

thumb_up_off_alt105

chat_bubble_outline1

repeat19

shareShare

kourosh hakhamaneshi

@cyrushakha

a month ago

Ray ♥️ PyTorch foundation: Open-source drivers of modern AI infrastructure

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

vLLM

@vllm_project

a month ago

🚀 Excited to share our work on batch-invariant inference in vLLM! Now you can get identical results regardless of batch size with just one flag: VLLM_BATCH_INVARIANT=1 No more subtle differences between bs=1 and bs=N (including prefill!). Let's dive into how we built this 🧵👇

thumb_up_off_alt276

chat_bubble_outline2

repeat43

shareShare

Robert Nishihara

@robertnishihara

a month ago

I enjoyed speaking at #PyTorchCon today. Wanted to share one slide from my talk about open source AI infra. This is about how Ray and vLLM work together. LLM inference is growing more and more complex, and doing a good job with LLM inference means working across layers and

thumb_up_off_alt183

chat_bubble_outline5

repeat30

shareShare

Simon Mo

@simon_mo_

a month ago

Fortunate to be part of two (!) foundation projects (vLLM and ray) that have great synergy with each other. The Ray + vLLM + PyTorch stack is coming together. Congratulations, Ray!

thumb_up_off_alt89

chat_bubble_outline0

repeat11

shareShare

Robert Nishihara

@robertnishihara

a month ago

I'm hiring for a new engineering role working directly with me to support our most sophisticated customers. Looking for someone who wants to work across the AI / AI infra stack, write / debug a ton of code, work directly with customers, move / learn super fast. DM me.

thumb_up_off_alt513

chat_bubble_outline39

repeat36

shareShare

Anyscale

@anyscalecompute

a month ago

Infra that ships. Ideas that scale. Keynotes at #RaySummit 2025: • Chelsea Finn (robot learning, meta-learning) • Jimmy Ba (Adam optimizer, DL/RL) • Peter Ludwig (safe AI at scale) Nov 3–5 SF Save your seat: 🔗na2.hubs.ly/H01MYqw0

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

uccl_project

@uccl_proj

a month ago

🚀 Introducing UCCL-EP: A portable, efficient Expert Parallelism framework that brings DeepEP-level GPU-driven communication with the same APIs to any cloud or hardware — AWS EFA, AMD GPUs, Broadcom NICs and beyond. Blog: uccl-project.github.io/posts/uccl-ep/ Code: github.com/uccl-project/u…

thumb_up_off_alt3

chat_bubble_outline1

repeat2

shareShare

Ziming Mao

@ziming_mao

a month ago

A couple of months ago we were perplexed by slow MoE communication perf on cloud (e.g. EFA with perplexity kernels). So we built UCCL-EP — an efficient GPU-driven EP library that runs on public clouds (e.g. AWS EFA) and heterogeneous GPUs/NICs, with the same APIs as DeepEP.

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

vLLM

@vllm_project

a month ago

🔥Ray Summit 2025 will be one of the biggest events for vLLM this year, with 10+ talks centered around vLLM! Looking forward to see you there. 🤩Use our discount code (limited time only!) RAYVLLM50 anyscale.com/ray-summit/2025

thumb_up_off_alt32

chat_bubble_outline0

repeat10

shareShare

kourosh hakhamaneshi

@cyrushakha

a month ago

Except that the network latency will kill the utility of the aggregate power.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare