Pradeep Ramani (@_prrama) Twitter Tweets • TwiCopy

Soumith Chintala

6 years ago

I have taken off today and yesterday at work, because I am not able to focus. I can't imagine having a paper deadline right now, and I can't imagine the personal stress my black friends are in. NeurIPS Conference consider extending the deadline, even if it's selectively done.

thumb_up_off_alt281

chat_bubble_outline3

repeat31

shareShare

David Hogg 🟧

@davidhogg111

6 years ago

The White House is not Donald Trumps house it's our fucking house and it's time to take it back.

thumb_up_off_alt81,81K

chat_bubble_outline800

repeat9,9K

shareShare

Pradeep Ramani

@_prrama

6 years ago

Trying to book evacuation flights via Air India is probably the worst experience one can ever had dealing with any business ! If you are incapable of providing ANY level of service - don't do it ! Zero leadership, Zero Service, Zero transparency ! #AirIndiaSucks

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Pradeep Ramani

@_prrama

6 years ago

People are already so stressed out, stranded in the US with no Visa and No medical Insurance - and booking Evac flights via @airindiain is a nightmare !. No clarity, horrible customer service, dead website links and phone numbers !. FIX IT ! PMO India @airindiain #AllowPvt

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Sundar Pichai

@sundarpichai

5 years ago

Immigration has contributed immensely to America’s economic success, making it a global leader in tech, and also Google the company it is today. Disappointed by today’s proclamation - we’ll continue to stand with immigrants and work to expand opportunity for all.

thumb_up_off_alt64,64K

chat_bubble_outline1,1K

repeat9,9K

shareShare

Andrea Ventura

@aventura71

5 years ago

A very sad day for US science and innovation. We will pay a hefty price for this demagogic insanity. 90% of my lab, myself included, is made of immigrants.

thumb_up_off_alt501

chat_bubble_outline6

repeat69

shareShare

Andrew Ng

@andrewyng

5 years ago

New U.S. Immigration and Customs Enforcement policy regarding F-1 visa international students is horrible & will hurt the US, students, and universities. Pushes universities to offer in-person classes even if unsafe or no pedagogical benefit, or students to leave US amidst pandemic and risk inability to return.

thumb_up_off_alt3,3K

chat_bubble_outline46

repeat763

shareShare

PyTorch

@pytorch

5 years ago

v1.6: native mixed-precision support from NVIDIA (~2x perf improvement), distributed perf improvements, new profiling tool for memory consumption, Microsoft commits to developing and maintaining Windows PyTorch. Release Notes: github.com/pytorch/pytorc… Blog:pytorch.org/blog/pytorch-1…

thumb_up_off_alt768

chat_bubble_outline5

repeat228

shareShare

Greg Siskind

@gsiskind

5 years ago

I'm part of the pro bono litigation effort planning to quickly file a lawsuit challenging the onerous DOL wage rule impacting H-1Bs and PERMs. We're needing employers, employees and membership organizations to volunteer as plaintiffs. If interested, go to docs.google.com/forms/d/e/1FAI….

thumb_up_off_alt635

chat_bubble_outline53

repeat442

shareShare

Jason Turner

@lefticus

3 years ago

Find Carbon interesting? Want a modern approach to language design? WITH a compiler you can play with today? AND is prioritizing safety? AND has C++ interop? WHY haven't you looked at github.com/SerenityOS/jakt from @jntrnr and Andreas Kling ?

thumb_up_off_alt140

chat_bubble_outline11

repeat14

shareShare

Vijay

@__tensorcore__

2 years ago

We gave our first in depth publicly available talk on CUTLASS 3.x and it’s up on YouTube now!

thumb_up_off_alt53

chat_bubble_outline0

repeat6

shareShare

Dylan Patel ✈️ ICLR

@dylan522p

2 years ago

If you work in AI this is the highest alpha channel out there What are you doing anon? Binge these videos now. youtube.com/@cudamode?si=M…

thumb_up_off_alt1,1K

chat_bubble_outline19

repeat157

shareShare

Tri Dao

@tri_dao

a year ago

FlashAttention is widely used to accelerate Transformers, already making attention 4-8x faster, but has yet to take advantage of modern GPUs. We’re releasing FlashAttention-3: 1.5-2x faster on FP16, up to 740 TFLOPS on H100 (75% util), and FP8 gets close to 1.2 PFLOPS! 1/

thumb_up_off_alt2,2K

chat_bubble_outline29

repeat336

shareShare

Haicheng Wu

@asdf1234_0

a year ago

CUTLASS reached 5K stars this summer with 3.5M downloads per month. Thank you for your support! github.com/NVIDIA/cutlass/

thumb_up_off_alt333

chat_bubble_outline8

repeat37

shareShare

Vijay

@__tensorcore__

a year ago

🔥🚨 CUTLASS Blackwell is here 🚨🔥 3.8 release is loaded with support for new features of Blackwell, even an attention kernel 👀 Go check it out here: github.com/nvidia/cutlass Can't wait to see what y'all end up cooking with this over the next few moths and years 💚

thumb_up_off_alt125

chat_bubble_outline5

repeat32

shareShare

Haicheng Wu

@asdf1234_0

10 months ago

CUTLASS is in the center of the CUDA Blackwell release blog. As always, we work hand in hand with CUDA team to deliver the next level performance. developer.nvidia.com/blog/cuda-tool…

thumb_up_off_alt125

chat_bubble_outline1

repeat25

shareShare

Vijay

@__tensorcore__

7 months ago

🚨🔥 CUTLASS 4.0 is released 🔥🚨 pip install nvidia-cutlass-dsl 4.0 marks a major shift for CUTLASS: towards native GPU programming in Python slidehelloworld.png docs.nvidia.com/cutlass/media/…

thumb_up_off_alt407

chat_bubble_outline15

repeat81

shareShare

NVIDIA HPC Developer

@nvidiahpcdev

7 months ago

🎉CUTLASS 4.0 is here-bringing native #Python support for device-side kernel design, for ops like GEMM, Flash Attention, and more, powered by the new CuTe DSL. For the first time, you can write high-performance GPU kernels in Python with the same abstractions, APIs, and

thumb_up_off_alt115

chat_bubble_outline1

repeat36

shareShare

Vijay

@__tensorcore__

7 months ago

thumb_up_off_alt45

chat_bubble_outline0

repeat4

shareShare

Wentao Guo

@wentaoguo7

5 months ago

🦆🚀QuACK🦆🚀: new SOL mem-bound kernel library without a single line of CUDA C++ all straight in Python thanks to CuTe-DSL. On H100 with 3TB/s, it performs 33%-50% faster than highly optimized libraries like PyTorch's torch.compile and Liger. 🤯 With Ted Zadouri and Tri Dao

thumb_up_off_alt316

chat_bubble_outline11

repeat66

shareShare