Jagged Intelligence
The word I came up with to describe the (strange, unintuitive) fact that state of the art LLMs can both perform extremely impressive tasks (e.g. solve complex math problems) while simultaneously struggle with some very dumb problems.
E.g. example from two
🚀 New Year New Name: NOXUS
Inspired by the ambition and vision of the Noxus organization from League of Legends, Noxus delivers enterprise-ready AI intelligence and automation without building from the ground up.
The only AI platform your organization will ever need. #Noxus
Deepseek hitting back against OpenAI and besting them at their own game while being the actually Open company wasn't on my bingo card for 2025, I'll be honest.
The most exciting thing about DeepSeek R1 isn’t that it’s open-source or matches OpenAI’s o1 in reasoning tasks—or even that it’s 90–95% cheaper.
The real breakthrough? It proves that LLMs can improve reasoning through pure reinforcement learning, without massive CoT datasets.
We've been migrating some production workloads to R1. It's surprising how it's clearly beating o1 and o1-mini in real use cases for us. It's not a benchmark warrior
The new smaller SmolVLM models just dropped, so ofc we had to train a ColPali version for them!
Introducing the ColSmol family: the 500M model can retrieve documents with higher accuracy compared to the original ColPali checkpoint with about 6x less weights 🚀 (1/4 🧵)
It’s done because it’s much easier to 1) collect, 2) evaluate, and 3) beat and make progress on. We’re going to see every task that is served neatly packaged on a platter like this improved (including those that need PhD-grade expertise). But jobs (even intern-level) that need
Unbelievable results, feels like a dream—our R1 model is now #1 in the world (with style control)! 🌍🏆 Beyond words right now. 🤯 All I know is we keep pushing forward to make open-source AGI a reality for everyone. 🚀✨ #OpenSource #AI #AGI #DeepSeekR1