
Casey Chu
@caseychu9
Researcher at @openai
ID: 902411487758581765
https://caseychu.io 29-08-2017 06:03:59
241 Tweet
3,3K Takipçi
655 Takip Edilen




I was leading long context in Gemini for a while now, and today I’m proud to share what the team has achieved: over 1M context in a large-scale foundation model. Big shoutout to Denis Teplyashin and Machel Reid - without you this would not have happened! youtube.com/watch?v=wa0MT8…

Excited to share what Bill Peebles Tim Brooks and my team has been working on for the past year! Our text-to-video model Sora can generate videos of complex scenes up to a minute long. We're excited about making this step toward AI that can reason about the world like we do.




justice for Barret Zoph 🪵

GPT-4o would not have happened without the vision, talent, conviction, and determination of Prafulla Dhariwal over a long period of time. that (along with the work of many others) led to what i hope will turn out to be a revolution in how we use computers.

I had the joy and the honor of being invited to give the Harvey Mudd College commencement address this year. In the vector space of all advice, I explore a 5-dimension subspace orthogonal to the “follow your dreams” vector. YouTube Link: youtu.be/W3I3kAg2J7w

An intuition for relative memory access times (scaled 10^10): Reg: 2 sec - Take from shelf Cache: 6½ min - Get from garage DDR Main: 20 min - Go to store DDR CXL: 1hr Far Mem: 8hr SSD: 6 days - Order online Spinning Disk (3ms): 1yr! Via Dylan Patel & SemiAnalysis





LLMs have complex joint beliefs about all sorts of quantities. And my postdoc James Requeima visualized them! In this thread we show LLM predictive distributions conditioned on data and free-form text. LLMs pick up on all kinds of subtle and unusual structure: 🧵


