BREAKING🚨 So, I tested this new LLM-based system. It generated this 200-page report I didn't read and then this 150-page book I didn't read either, and then a 20-page travel plan I didn't verify.
All I can say: it's very, very impressive! 🔥🚀
First, the number of pages it
For economists, the tariff spectacle currently coming out of the White House econ team is akin to watching NASA being taken over by astrologers and repurposing the International Space Station to charge magic healing crystals with moonbeams.
Ok, big reveal for the Odyssey experiment:
A clear majority (48.4%) preferred translation D, which was done by... GPT4o.
The others were Emily Wilson (A, 22.6%), Lattimore (B, 11%) and Fitzgerald (C, 18.1%).
Chinese firm offers high-performance, low-cost satellites to belt and road countries buff.ly/jzLQ9TH Breakthrough makes powerful sub-metre Kuanfu 02B the lightest on the market, Chang Guang Satellite Technology says.
DOGE said 40% of phone calls into Social Security centers were fraud, so it built a tool to track it. Turns out the 40% was actually 0.0018% and the tool slowed down processing significantly.
🚨 BREAKING: The reason o3 and o3-pro are insanely cheap?
OpenAI secretly used Codex as an internal agentic software engineer to brutally optimize inference costs.
We’re talking surgical-level code refinement, system-wide efficiency gains, done by AI coding AI.
Source?
Someone at Novo Nordisk failed to pay a $450 maintenance fee, which would have kept its patent on Ozempic in force for another two years...
A very expensive mistake!
Okay I read the MIT "Your Brain on ChatGPT" preprint. (well, it's 140 pages long, so I read about 1/3 of it and skimmed the boring parts)
Here are my takeaways: 🧵
The thing I noticed during the blockchain craze is many of the people who were very excited about blockchain seemed to not actually know about databases. They were like, “imagine: a digital record of every transaction” as though that hadn’t already existed for 40 years.
Humanity has prevailed (for now!)
I'm completely exhausted. I figured, I had 10h of sleep in the last 3 days and I'm barely alive.
I'll post more about the contest when I get some rest.
(To be clear, those are provisional results, but my lead should be big enough)
1/N I’m excited to share that our latest OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
A striking thing about OpenAI's IMO gold math model is how terse it is, it really tries to express itself in single tokens. Often breaking the rules of grammar and spelling to do so. They say compression is intelligence. We may be seeing a totally novel way to do compression