mohit (@mohitwt_) Twitter Tweets • TwiCopy

mohit

2 months ago

wrote my first CUDA, a simple c = a + b: what its doing: > allocate CPU mem and fill with input data > allocate GPU mem > copy input data CPU to GPU > gpu kernel computes > copy result back gpu to cpu > print results from CPU > free GPU mem 😭

thumb_up_off_alt412

chat_bubble_outline23

repeat15

shareShare

mohit

@mohitwt_

2 months ago

matrix addition in CUDA, the first picture is launching 1 block of NxN threads. ~ each thread computes one element of matrix > dim3 threadsPerBlock(N,N); > MatAdd<<<1, threadsPerBlock>>>(d_A, d_B, d_C, N); this works fine with small matrices as i did N=4, which is only 16

thumb_up_off_alt38

chat_bubble_outline0

repeat2

shareShare

Raj Nair

@rajnair06

a month ago

(1/n) Building Learnflow -implemented throttling for Free/Premium Users 10/day for free and 1000 for premium(for the sake of testing) -cached monthly and weekly progress summaries -benchmark tests for cache and non-cached responses, insane diff more info below

thumb_up_off_alt12

chat_bubble_outline3

repeat1

shareShare

mohit

@mohitwt_

a month ago

guy is building crazy stuff

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

mohit

@mohitwt_

a month ago

Had my fun with Mint, its amazing, caused 0 problems since install but got bored of it, Installing Fedora

thumb_up_off_alt13

chat_bubble_outline3

repeat0

shareShare

mohit

@mohitwt_

a month ago

fedora is amazing. no ricing yet, just setting up stuff atm

thumb_up_off_alt16

chat_bubble_outline1

repeat0

shareShare

Raj Nair

@rajnair06

a month ago

Building LearnFlow -learnt about background jobs and queues -used Celery as task queue system and Redis as the message broker -offloaded goal inactivity reminder email sending task to background through this more details below

thumb_up_off_alt16

chat_bubble_outline3

repeat5

shareShare

mohit

@mohitwt_

a month ago

got a new desk and monitor, so insane. loving fedora aswell

thumb_up_off_alt19

chat_bubble_outline3

repeat0

shareShare

mohit

@mohitwt_

a month ago

A Detailed Explanation of CUDA Thread Hierarchy (Threads, Blocks, and Grids): A CUDA thread is the smallest unit of execution on the GPU, similar to a CPU thread, but designed for massive parallelism, each thread has its own registers and runs the same kernel code independently,

thumb_up_off_alt349

chat_bubble_outline4

repeat43

shareShare

mohit

@mohitwt_

a month ago

i didnt backup my frameworks obsidian vault before switching to fedora, fml 😭😭

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

mohit

@mohitwt_

a month ago

finally i can sleep, updates on my framework tmr.

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

mohit

@mohitwt_

a month ago

only reason to use LinkedIn

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

mohit

@mohitwt_

a month ago

update on my framework: > started implementation of Autograd Engine: the autograd engine has few key tasks: track tensor dependencies, compute gradients automatically, and backpropagate through operations. > added computation graph in the framework: forward pass works for

thumb_up_off_alt17

chat_bubble_outline1

repeat0

shareShare

mohit

@mohitwt_

a month ago

helpful diagram for memory hierarchy explanation:

thumb_up_off_alt21

chat_bubble_outline1

repeat2

shareShare

mohit

@mohitwt_

a month ago

1 away from 7 million

thumb_up_off_alt21

chat_bubble_outline4

repeat0

shareShare

mohit

@mohitwt_

a month ago

700! this is insanely quick, thank you guys

thumb_up_off_alt62

chat_bubble_outline15

repeat0

shareShare

mohit

@mohitwt_

a month ago

A short post on CUDA Programming Model: CUDA programming model, the way cuda lets developers actually write programs for massively parallel processors (hardware that runs thousands of threads at the same time) according to the NVIDIA CUDA c++ guide, cuda is built around three

thumb_up_off_alt44

chat_bubble_outline2

repeat2

shareShare

mohit

@mohitwt_

a month ago

thumb_up_off_alt13

chat_bubble_outline3

repeat0

shareShare

mohit

@mohitwt_

a month ago

made a medium account where i’ll be sharing detailed explanations about my framework. i’ll keep posting small progress updates on X, things like new features i added or quick milestones, but on medium, i’ll go deep into the actual implementation. i’ll be breaking down how each

thumb_up_off_alt26

chat_bubble_outline1

repeat4

shareShare

mohit

@mohitwt_

a month ago

ran a distilgpt2 model locally, tested with different prompts, tweaked generation parameters to observe variations in output. I'll share more on this tmr, gn.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare