mike64_t (@mike64_t) 's Twitter Profile
mike64_t

@mike64_t

descending the gradient

ID: 1581044963114209286

calendar_today14-10-2022 22:11:31

2,2K Tweet

2,2K Takipçi

282 Takip Edilen

mike64_t (@mike64_t) 's Twitter Profile Photo

wellp and there goes the austria doesn’t have school shootings streak. lots of things we can learn from the US, but this ain’t one of them

Sebastian Aaltonen (@sebaaltonen) 's Twitter Profile Photo

It's easy to blame graphics as it's visible, but that's not the reason for modern software slowness. The reason is CPU cache misses, mutex waits and stupid slow code everywhere that nobody knows exists, because they never run a debugger or profiler.

tenderizzation (@tenderizzation) 's Twitter Profile Photo

primeintellect sending a tensor through PCI-E to host memory, through ethernet/TCP/IP across a continent to another node's host memory, and back down through PCI-E again

TBPN (@tbpn) 's Twitter Profile Photo

"You can break AI down into 5 tiers." - George Hotz 🌑 "Data centers - tier 1, fabs - tier 2, Nvidia/AMD - tier 3, OpenAI/Anthropic - tier 4, and completely worthless things like Cursor and Windsurf, which are tier 5." "OpenAI and Anthropic will eat all the value from the

Yann LeCun (@ylecun) 's Twitter Profile Photo

It is intuitively obvious that reasoning in continuous embedding space is dramatically more powerful than reasoning in discrete token space. This paper from Yuandong Tian and team show that it is the case theoretically.