t0mcruzz (@getvalver) Twitter Tweets • TwiCopy

Jeff Atwood

@codinghorror

4 years ago

Let's work together to update the single most influential book of the BASIC era! blog.codinghorror.com/updating-the-s…

thumb_up_off_alt462

chat_bubble_outline33

repeat89

shareShare

Catherine

@whitequark

2 years ago

it's happening!!

thumb_up_off_alt903

chat_bubble_outline38

repeat50

shareShare

Jim Keller

@jimkxa

2 years ago

Moore's law is still not dead!! My guess 1000x to go on known physics. Probably not EUV

thumb_up_off_alt1,1K

chat_bubble_outline64

repeat153

shareShare

will whang🌻

@will_whang

2 years ago

oshwhub.com/malong/PEX8796… Holy shit........Σヽ(ﾟД ﾟ; )ﾉ PEX8796 96-Lane PCIe3.0 Opensource PCIe card!

thumb_up_off_alt221

chat_bubble_outline7

repeat39

shareShare

Dmitry Jemerov

@intelliyole

2 years ago

A big interview with me (in Russian) has just been published: youtube.com/watch?v=8f-YLC…

thumb_up_off_alt70

chat_bubble_outline3

repeat8

shareShare

James Lin

@jlinbio

2 years ago

Halfway through deciphering machine god

thumb_up_off_alt372

chat_bubble_outline18

repeat20

shareShare

In the 90s, there were a dozen companies making graphics accelerators, and Nvidia wasn’t initially a clear winner. Their first product was terrible, and 3DFX, 3DLabs, Rendition, and others all had important pieces of the puzzle earlier. However, they relentlessly improved and

thumb_up_off_alt3,3K

chat_bubble_outline144

repeat307

shareShare

Douglas Mun

@douglasmun

2 years ago

*Evolution of click farm fraud.* 1st generation click farm fraud, fully manual labour.

thumb_up_off_alt33,33K

chat_bubble_outline465

repeat5,5K

shareShare

абстрактный мужик

@abstract_artem

2 years ago

сторитайм в связи с безумной уязвимостью в xz через изменение билд скриптов короче, году в 2017м ковыряя JVM билд системы по работе: Gradle, Buck, Bazel, Maven до меня дошло что Gradle отличается от них всех подключением Java аннотейшн процессоров — это такой API

thumb_up_off_alt277

chat_bubble_outline10

repeat22

shareShare

Andrej Karpathy

@karpathy

2 years ago

Congrats to AI at Meta on Llama 3 release!! 🎉 ai.meta.com/blog/meta-llam… Notes: Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ LMSYS Org :)) 400B is still training, but already encroaching

thumb_up_off_alt7,7K

chat_bubble_outline140

repeat1,1K

shareShare

Sebastian Raschka

@rasbt

2 years ago

If you are looking for something to code & read this weekend, I uploaded a notebook to finetune a small GPT model to classify SPAM messages with ~96% accuracy: github.com/rasbt/LLMs-fro… (Fun fact: it's small enough to train it on your laptop; ~5 min on my M3 MacBook Air!)

thumb_up_off_alt1,1K

chat_bubble_outline32

repeat324

shareShare

Vrushank Desai

@vrushankdes

2 years ago

I spent a couple months at the beginning of this year learning about GPU programming through trying to optimize inference for Cheng Chi awesome Diffusion Policy paper. I was able to improve inference time for the denoising U-Net by ~3.4x over Pytorch eager mode and ~2.65x over

thumb_up_off_alt594

chat_bubble_outline31

repeat91

shareShare

the tiny corp

@__tinygrad__

2 years ago

.AMD @amdradeon released some MES documentation today! (it's on GPUOpen) A good start, but we are bypassing the MES now in our "AMD" backend. We are even bypassing most of the MEC. Can you document the PM4 packets and what happens after you poke COMPUTE_DISPATCH_INITIATOR?

.<a href="/AMD/">AMD</a> @amdradeon released some MES documentation today! (it's on GPUOpen)

A good start, but we are bypassing the MES now in our "AMD" backend. We are even bypassing most of the MEC.

Can you document the PM4 packets and what happens after you poke COMPUTE_DISPATCH_INITIATOR?

thumb_up_off_alt330

chat_bubble_outline8

repeat21

shareShare

Tom Yeh

@proftomyeh

2 years ago

Transformer by Hand✍️ To study the transformer architecture, it is like opening up the hood of a car and seeing all sorts of engine parts: embeddings, positional encoding, feed-forward network, attention weighting, self-attention, cross-attention, multi-head attention, layer

thumb_up_off_alt2,2K

chat_bubble_outline21

repeat478

shareShare

Alex Reibman 🖇️

@alexreibman

2 years ago

Apparently we were #1 on Hacker News today??

thumb_up_off_alt534

chat_bubble_outline19

repeat30

shareShare

Tom Yeh

@proftomyeh

2 years ago

llm.c by Hand✍️ C programming + matrix multiplication by hand This combination is perhaps as low as we can get to explain how the Transformer works. Special thanks to Andrej Karpathy for encouraging early feedback and tetsuo - cRc for helping me understand the pragma magic. I hope

thumb_up_off_alt2,2K

chat_bubble_outline29

repeat484

shareShare

Hasen Judi

@hasen_judi

a year ago

Your career will be derailed for a decade if you go this route

thumb_up_off_alt3,3K

chat_bubble_outline93

repeat113

shareShare

Kuter Dinel

@kuterdinel

a year ago

Here is RTX4090 ISA Spec Please retweet kuterdinel.com/nv_isa_sm89/ Accidentally deleted the last tweet🫠

thumb_up_off_alt287

chat_bubble_outline3

repeat59

shareShare

Tagir Valeev

@tagir_valeev

a year ago

Почему хорошо иметь детей. 1. Обнимаешь ребёнка — он тёплый. 2. Можно с ребёнком гулять на детской площадке, крутиться на каруселях, кататься с горки. 3. Всегда есть, с кем дома поиграть в настолки, не надо никого звать. 4. Он смешной.

thumb_up_off_alt5,5K

chat_bubble_outline250

repeat159

shareShare

Anthony Bonato

@anthony_bonato

a year ago

In honor of Taylor Swift's recent 35th birthday, here are 35 Taylor series

thumb_up_off_alt3,3K

chat_bubble_outline25

repeat413

shareShare

t0mcruzz

Jeff Atwood

Catherine

Jim Keller

will whang🌻

Dmitry Jemerov

James Lin

John Carmack

Douglas Mun

абстрактный мужик

Andrej Karpathy

Sebastian Raschka

Vrushank Desai

the tiny corp

Tom Yeh

Alex Reibman 🖇️

Tom Yeh

Hasen Judi

Kuter Dinel

Tagir Valeev

Anthony Bonato