Jeff (@weekeypedia) Twitter Tweets • TwiCopy

Hussein Nasser

3 years ago

Did you know that server certificate returned in the TLS server hello can be large? I have seen ones up to 10KB especially when the full chain is included. This can slow down the handshake especially when latency is high. Ways to address this: - Compress the certificate

thumb_up_off_alt669

chat_bubble_outline6

repeat81

shareShare

Nassim Nicholas Taleb

@nntaleb

3 years ago

ChatGPT is the modern version of Flaubert's "Dictionary of Received Ideas" (Dictionnaire des idées reçues), that is, a powerful cliché parroting engine. And, as they say in trading: "what most people know isn't worth knowing."

thumb_up_off_alt4,4K

chat_bubble_outline196

repeat651

shareShare

Hacker News Bot

@newsycombinator

3 years ago

Everything You Always Wanted to Know About Mathematics [pdf] math.cmu.edu/~jmackey/151_1…

thumb_up_off_alt63

chat_bubble_outline0

repeat12

shareShare

Danielle Fong 🔆

@daniellefong

3 years ago

good summary about the floaty rock drama so far “if you’re not following LK99, you’re missing out on the most fun thing happening or the internet now. Feels like the old internet.” reddit.com/r/redscarepod/…

thumb_up_off_alt16,16K

chat_bubble_outline138

repeat2,2K

shareShare

Alycia Baumgardner

@alyciambaum

3 years ago

thumb_up_off_alt1,1K

chat_bubble_outline0

repeat247

shareShare

Andrej Karpathy

@karpathy

2 years ago

These 94 lines of code are everything that is needed to train a neural network. Everything else is just efficiency. This is my earlier project Micrograd. It implements a scalar-valued auto-grad engine. You start with some numbers at the leafs (usually the input data and the

thumb_up_off_alt15,15K

chat_bubble_outline208

repeat1,1K

shareShare

Sky News

@skynews

2 years ago

Kenyans see fellow protesters killed in front of them but they choose to keep fighting Read the eyewitness report from Sky's Africa correspondent Yousra Elbagir trib.al/SkdPUtV

thumb_up_off_alt9,9K

chat_bubble_outline296

repeat6,6K

shareShare

@bluecow 🐮(schizo)

@bluecow009

2 years ago

I just opensourced something I have been working on for months. I call it “super prompt” because it also allows some LLMs (claude) to come up with really novel ideas, (picture is an example the prompt is larger). Its built in XML agent format btw. Github in comments.

thumb_up_off_alt6,6K

chat_bubble_outline218

repeat557

shareShare

Rohan makes compilers better 🛠️🚀

@rohan_devarc

2 years ago

All main memory workloads eventually end up being bottlenecked by throughput of a DRAM. This further causes bad perf metrics which are reflected in CPU memory stalls. A master class in performance engineering! valkey.io/blog/unlock-on…

thumb_up_off_alt255

chat_bubble_outline4

repeat39

shareShare

Peter Kraft

@petereliaskraft

a year ago

Every day on YouTube, people upload 4 million videos and watch 5 billion videos. Handling this staggering traffic requires a vast fleet of servers. So when a new request comes in, where does it go? How do you balance load across so many servers for so many jobs? I love this

thumb_up_off_alt2,2K

chat_bubble_outline25

repeat324

shareShare

Teknium (e/λ)

@teknium1

a year ago

This is the entire code needed to reproduce R1 lol Hundreds of Billions of Dollars Later

thumb_up_off_alt18,18K

chat_bubble_outline431

repeat1,1K

shareShare

steve

@stevenpwalsh

10 months ago

Moon One of the things we learned with Anthropics circuits work, is how the model explains it's thought process is not always what it's thought process actually was. Probably just something to keep in mind.

thumb_up_off_alt789

chat_bubble_outline13

repeat16

shareShare

Fernando 🇮🇹🇨🇭

@franc0fernand0

8 months ago

1. What every engineer should know when working with distributed systems. francofernando.substack.com/p/the-challeng…

thumb_up_off_alt13

chat_bubble_outline1

repeat1

shareShare

Yu Wang

@__yuwang__

8 months ago

Introducing The Most Advanced Memory System for LLM Agents MIRIX is by far the most advanced memory system in the world, designed to make AI truly remember, learn, and help you over time. Website: mirix.io Paper: arxiv.org/abs/2507.07957 Github:

thumb_up_off_alt807

chat_bubble_outline14

repeat132

shareShare

Quentin Anthony

@quentinanthon15

8 months ago

I was one of the 16 devs in this study. I wanted to speak on my opinions about the causes and mitigation strategies for dev slowdown. I'll say as a "why listen to you?" hook that I experienced a -38% AI-speedup on my assigned issues. I think transparency helps the community.

thumb_up_off_alt3,3K

chat_bubble_outline98

repeat423

shareShare

AVB

@neural_avb

8 months ago

This is an awesome article. The best part is their note to “build around the KV cache”. If your system prompt remains consistent, your tools remain constant, and you always append to conversation json… you will hit the KV cache often. Cutting down cost and latency.

thumb_up_off_alt628

chat_bubble_outline5

repeat83

shareShare

Victoria Slocum

@victorialslocum

6 months ago

When should you chunk your documents, before embedding, or after querying? Most RAG systems use 𝗽𝗿𝗲-𝗰𝗵𝘂𝗻𝗸𝗶𝗻𝗴 - the standard approach where you break documents into smaller pieces first, then embed and store them in your vector database. This requires upfront decisions

thumb_up_off_alt1,1K

chat_bubble_outline19

repeat211

shareShare

sankalp

@dejavucoder

6 months ago

i recommend reading this

thumb_up_off_alt3,3K

chat_bubble_outline35

repeat263

shareShare

Ahmad

@theahmadosman

4 months ago

last month, Karpathy dropped the ULTIMATE guide to speed-running your way into LLMs in this project, you’ll build all the essentials, all under 8k lines of code > train the tokenizer, new rust implementation > pretrain a transformer LLM on fineweb > evaluate core score

thumb_up_off_alt833

chat_bubble_outline7

repeat76

shareShare