Anthonix (@zealandic1) Twitter Tweets • TwiCopy

Anthonix

a month ago

"Apollo and the iPhone 16 Pro Max"

thumb_up_off_alt62

chat_bubble_outline0

repeat3

Anthonix

a month ago

Pierre Vannier - AI Engineer // 🇪🇺eu/acc Marc Andreessen 🇺🇸 Well Marc Andreessen 🇺🇸 did say Apollo... but here's "Aphrodite with the iPhone of the gods"

<a href="/pierre_vannier/">Pierre Vannier - AI Engineer // 🇪🇺eu/acc</a> <a href="/pmarca/">Marc Andreessen 🇺🇸</a> Well <a href="/pmarca/">Marc Andreessen 🇺🇸</a> did say Apollo... but here's "Aphrodite with the iPhone of the gods"

thumb_up_off_alt2

chat_bubble_outline1

repeat1

Anthonix

a month ago

Wow, I just assumed that Zuck had created his own masterpiece outta chicken wire and bondo... turns out it is the work of a real artist

thumb_up_off_alt1

chat_bubble_outline0

Anthonix

a month ago

Finally got around to trying out llm.s on MI300x.. the code I had tuned on MI250x gets decent perf straight outta the gate. But wtf is going on with PyTorch perf on MI300x? Tried 2.4 & nightly, with rocm 6.1&6.2.. using full autotuning, flash attention etc.. all are so slow

thumb_up_off_alt15

chat_bubble_outline4

Anthonix

a month ago

Awesome! Also for local training, this would enable training across a bunch of machines without expensive interconnect :)

thumb_up_off_alt11

chat_bubble_outline1

Anthonix

19 days ago

Hitting ~3.2M toks/sec on MI300x for tiny llama3 training. Would love to see @pytorch training on MI300x get some massive improvements so I don't have to write my own kernels!

thumb_up_off_alt38

chat_bubble_outline3

repeat1

Anthonix

16 days ago

Was the model trying to upload itself? lol

thumb_up_off_alt1

chat_bubble_outline0