Robert Nishihara (@robertnishihara) Twitter Tweets • TwiCopy

Robert Nishihara

6 months ago

One of our biggest advantages is talent. We get top people from all over the world who largely stay here and contribute to innovation here (especially AI). Turning away these people is a mistake.

thumb_up_off_alt79

chat_bubble_outline1

repeat8

shareShare

Anyscale

@anyscalecompute

6 months ago

1/5 How do you deliver fast, personalized search to 100M+ users? You need more than a good model. You need great infrastructure. Here’s how Notion scaled their search pipeline using Ray + Anyscale ↓🧵

thumb_up_off_alt8

chat_bubble_outline1

repeat2

shareShare

Robert Nishihara

@robertnishihara

6 months ago

Very excited to be working with Notion on their AI efforts.

thumb_up_off_alt14

chat_bubble_outline0

repeat5

shareShare

Robert Nishihara

@robertnishihara

6 months ago

Looking forward to speaking at the Weights & Biases conference next week! Will be talking about open-source AI infra. wandb.ai/site/resources…

Looking forward to speaking at the <a href="/weights_biases/">Weights & Biases</a> conference next week! Will be talking about open-source AI infra.

wandb.ai/site/resources…

thumb_up_off_alt9

chat_bubble_outline0

repeat5

shareShare

This is an incredibly technical talk from Nubank on building foundation models for financial transactions. The team that did this joined Nubank via an acquisition. They proceeded to leverage all of Nubank's existing data & model infrastructure and surgically insert foundation

This is an incredibly technical talk from <a href="/nubank/">Nubank</a> on building foundation models for financial transactions. The team that did this joined Nubank via an acquisition. They proceeded to leverage all of Nubank's existing data & model infrastructure and surgically insert foundation

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Lukas Biewald

@l2k

6 months ago

My friend Robert Nishihara told me a fun math problem the other day. O3 gets it wrong even after a fair amount of prodding - is this somehow harder than the olympiad problems where it gets 95% accuracy?

My friend <a href="/robertnishihara/">Robert Nishihara</a> told me a fun math problem the other day. O3 gets it wrong even after a fair amount of prodding - is this somehow harder than the olympiad problems where it gets 95% accuracy?

thumb_up_off_alt3

chat_bubble_outline2

repeat1

shareShare

Robert Nishihara

@robertnishihara

6 months ago

Ray Summit talk submissions are now open! We're very very excited to hear about your work. - How you use Ray - How you use vLLM - AI infrastructure - Multimodal data - Post-training - Agentic systems - Challenges at scale

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare

Robert Nishihara

@robertnishihara

6 months ago

I've already heard two companies this week say "we built out everything around text data, then we began introducing images / video and everything broke."

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Robert Nishihara

@robertnishihara

6 months ago

The AI compute software stack consists of 3 specialized layers: 🔧🔧🔧 Layer 1: Training & Inference Framework (PyTorch + vLLM) • Runs models efficiently on GPUs • Handles model optimization and model parallelism strategies • Manages accelerator memory and automatic

thumb_up_off_alt63

chat_bubble_outline1

repeat20

shareShare

kourosh hakhamaneshi

@cyrushakha

6 months ago

I get a lot of questions around what is the role of each of these layers of AI compute stack: vLLM, ray, k8s, etc. What does ray do in vLLM, what does ray do around vLLM? Why is Ray core part of post-training frameworks like vERL, etc? In this blog Robert Nishihara depicts what a

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Robert Nishihara

@robertnishihara

6 months ago

This table was a footnote at the end of the blog, but it's actually one of the most interesting points. There is an emerging stack for post-training. anyscale.com/blog/ai-comput…

thumb_up_off_alt31

chat_bubble_outline1

repeat7

shareShare

Gabe Monroy

@gabe_monroy

6 months ago

An OSS stack for AI compute: Kubernetes+ Ray + PyTorch + vLLM ... from Robert Nishihara of Anyscale anyscale.com/blog/ai-comput…

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

Robert Nishihara

@robertnishihara

6 months ago

Beyond pre-training, here's how I imagine most learning will work. 1. AI models / systems will maintain large collections of retrievable knowledge. This will include facts like "the capital of California is Sacramento" and tactics like "when playing Monopoly, buy a bunch of

thumb_up_off_alt23

chat_bubble_outline0

repeat4

shareShare

Ivan Nardini

@ivnardini

6 months ago

I really enjoyed the new blog from Anyscale about open source stack for AI compute. Robert shared a great collection of examples showing how companies such as Pinterest, Uber, and Roblox integrate Kubernetes, Ray, PyTorch, and vLLM. This stack enables extensive training

thumb_up_off_alt13

chat_bubble_outline2

repeat3

shareShare

Robert Nishihara

@robertnishihara

6 months ago

Impressive work! Agentic workflows have tons and tons of design and arcitectural decisions that affect performance and quality (choices around models, embedddings, how to tokenize / chunk data, how to do retrieval, how to construct context, etc). There's a massive search space,

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Robert Nishihara

@robertnishihara

6 months ago

At a recent birthday party, we had a bunch of bingo ice breaker questions. The winner was Alex Levy.

At a recent birthday party, we had a bunch of bingo ice breaker questions.

The winner was <a href="/alevy/">Alex Levy</a>.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare