Corendos (@thecorendos) 's Twitter Profile
Corendos

@thecorendos

Software Engineer @zml_ai

ID: 1064148302

linkhttp://vent.godeau.xyz calendar_today05-01-2013 22:22:25

307 Tweet

48 Takipçi

82 Takip Edilen

Steeve Morin 🇺🇦 (@steeve) 's Twitter Profile Photo

Hey folks, since we've been asked so many times, here is a quick demo of what we're building. This is a small LLaMA2 sharded on 1 NVIDIA RTX 4090 (in Paris), 1 AMD 6800XT (in Corendos's flat) and 1 Google Cloud TPU v2 over Tailscale. Exact same code, all built on my Mac

Steeve Morin 🇺🇦 (@steeve) 's Twitter Profile Photo

So we're working on a custom async HTTP/1.1 server for the ZML LLM serving stack. This is on a single thread, M3 Max. Uses stackful coroutines and Mitchell Hashimoto's libxev. Not gonna lie, pretty happy about the performance.

So we're working on a custom async HTTP/1.1 server for the <a href="/zml_ai/">ZML</a> LLM serving stack. This is on a single thread, M3 Max.

Uses stackful coroutines and <a href="/mitchellh/">Mitchell Hashimoto</a>'s libxev.

Not gonna lie, pretty happy about the performance.
Corendos (@thecorendos) 's Twitter Profile Photo

> Tweet random de réponse "Lisa a aimé votre réponse" > Autre tweet random de réponse "Brenda a aimé votre réponse" L'aubaine, X Entreprise est à -30% !

Pekka Kallioniemi (@p_kallioniemi) 's Twitter Profile Photo

In today’s Vatnik Soup, I’ll introduce how the Elon’s Election Interference Machine™ (EIM) works. Since Musk & his broligarch allies won the presidency for Trump, they’ve now harnessed the machine to interfere in European elections, including the ones in Germany & Romania. 1/23

In today’s Vatnik Soup, I’ll introduce how the Elon’s Election Interference Machine™ (EIM) works. Since Musk &amp; his broligarch allies won the presidency for Trump, they’ve now harnessed the machine to interfere in European elections, including the ones in Germany &amp; Romania.

1/23
Harry Stebbings (@harrystebbings) 's Twitter Profile Photo

So the show with Jonathan Ross absolutely crushed last week but left me with many more questions. Today is one of our most nerdy but fantastic episodes. - Why Google wins the AI arms race - Why OpenAI is not best positioned - NVIDIA vs. AMD - Inference vs training - The

So the show with <a href="/JonathanRoss321/">Jonathan Ross</a>  absolutely crushed last week but left me with many more questions.

Today is one of our most nerdy but fantastic episodes.

- Why Google wins the AI arms race
- Why OpenAI is not best positioned
- NVIDIA vs. AMD
- Inference vs training
- The
Steeve Morin 🇺🇦 (@steeve) 's Twitter Profile Photo

And after 1 week of work, here is zml/llmd running transparently on TPU with full prefill/decode paged attention. No code change, single flag, as it should be.