Mercury 2 is live 🚀🚀
The world’s first reasoning diffusion LLM, delivering 5x faster performance than leading speed-optimized LLMs.
Watching the team turn years of research into a real product never gets old, and I’m incredibly proud of what we’ve built.
We’re just getting
🚨 Meta, Google DeepMind, and OpenAI all ask the same thing in ML interviews: "Implement softmax from scratch.." Most candidates fail.
Someone just open sourced the training ground for it.
It's called TorchCode. LeetCode, but for PyTorch. 39 problems that test the exact skills
I built a tiny-vllm in C++ and CUDA
- paged attention
- continuous batching
- educational
- 100% human-written™
And now I writing a course where you will build your own vLLM yourself. Still work in progress, I'll finish by the end of April. All for free ofc, just a GitHub repo