Tri Dao (@tri_dao) 's Twitter Profile
Tri Dao

@tri_dao

Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.

ID: 568879807

linkhttps://tridao.me calendar_today02-05-2012 07:13:50

795 Tweet

28,28K Takipçi

602 Takip Edilen

Tri Dao (@tri_dao) 's Twitter Profile Photo

Getting mem-bound kernels to speed-of-light isn't a dark art, it's just about getting the a couple of details right. We wrote a tutorial on how to do this, with code you can directly use. Thanks to the new CuTe-DSL, we can hit speed-of-light without a single line of CUDA C++.