EstarozaElLimbi (@cppchedy) 's Twitter Profile
EstarozaElLimbi

@cppchedy

a Muslim, anime, manga, C++, Compiler Explorer.

ID: 891295719268765696

calendar_today29-07-2017 13:53:53

1,1K Tweet

103 Takipçi

596 Takip Edilen

Akshay 🚀 (@akshay_pachaar) 's Twitter Profile Photo

Serve 1000s of fine-tuned LLMs on a Single GPU! LoRAX by Predibase allows users to serve thousands of fine-tuned models on one GPU, reducing costs without compromising speed or performance. (100% open-source)

Serve 1000s of fine-tuned LLMs on a Single GPU!

LoRAX by Predibase allows users to serve thousands of fine-tuned models on one GPU, reducing costs without compromising speed or performance.

(100% open-source)
Jonathan Gorard (@getjonwithit) 's Twitter Profile Photo

This is a surprisingly subtle problem, since e.g. the floating-point representations in C code satisfy commutativity of addition/multiplication, but not associativity. So we needed to define a bespoke algebra of "correctness-preserving transformations" for C programs. (8/10)

Pramod Goyal (@goyal__pramod) 's Twitter Profile Photo

The best way to learn PyTorch is by building something with it. Consider checking out my blog where I help you, build transformers using PyTorch from scratch

The best way to learn PyTorch is by building something with it.

Consider checking out my blog where I help you, build transformers using PyTorch from scratch
🔥 Matt Dancho (Business Science) 🔥 (@mdancho84) 's Twitter Profile Photo

Correlation is the skill that has singlehandedly benefitted me the most in my career. In 3 minutes I'll demolish your confusion (and share strengths and weaknesses you might be missing). Let's go:

Correlation is the skill that has singlehandedly benefitted me the most in my career. 

In 3 minutes I'll demolish your confusion (and share strengths and weaknesses you might be missing).

Let's go:
jack morris (@jxmnop) 's Twitter Profile Photo

# A new type of information theory this paper is not super well-known but has changed my opinion of how deep learning works more than almost anything else it says that we should measure the amount of information available in some representation based on how *extractable* it is,

# A new type of information theory

this paper is not super well-known but has changed my opinion of how deep learning works more than almost anything else

it says that we should measure the amount of information available in some representation based on how *extractable* it is,
Shalini Goyal (@goyalshaliniuk) 's Twitter Profile Photo

Building a system that scales isn’t just about picking the right database - it’s about mastering the full stack of scalability. This powerful visual breaks down the 7 critical layers of scalable system design, from the UI to the infrastructure. Here’s what each layer brings to

Building a system that scales isn’t just about picking the right database - it’s about mastering the full stack of scalability.

This powerful visual breaks down the 7 critical layers of scalable system design, from the UI to the infrastructure.

Here’s what each layer brings to
alphaXiv (@askalphaxiv) 's Twitter Profile Photo

Introducing GPT-4.1 for understanding arXiv papers 🚀 Highlight any section of a paper to ask questions and “@” other papers to quickly add to context and compare results, benchmarks, etc.