William Fedus (@liamfedus) 's Twitter Profile
William Fedus

@liamfedus

Past: VP of Post-Training @OpenAI; Google Brain

ID: 885528008

linkhttp://acsweb.ucsd.edu/~wfedus/ calendar_today16-10-2012 23:14:47

1,1K Tweet

24,24K Takipçi

1,1K Takip Edilen

William Fedus (@liamfedus) 's Twitter Profile Photo

I have yet to find a well-defined task that cannot be optimized by these models. Eval improvement like ARC AGI showcase this dynamic

William Fedus (@liamfedus) 's Twitter Profile Photo

Reasoning has begun to deliver us better models like o1, o3, o3-mini, but the genuine unlock will be agents. Reasoning gives us better planning, tool-use, error recovery and I’m thrilled for this year. 2025 is the year of agents. Congrats team!!

Aidan Clark (@_aidan_clark_) 's Twitter Profile Photo

o3-mini's intelligence x speed combo is incredible, idk what to say other than just try it and see for yourself. This took 8 seconds, how long would it take you?

William Fedus (@liamfedus) 's Twitter Profile Photo

All free ChatGPT users now have reasoning models with o3-mini. The cost-intelligence frontier is shifting fast (o3-mini outperforms even o1 on many STEM evals!)

Hongyu Ren (@ren_hongyu) 's Twitter Profile Photo

We released o3-mini today! Everyone can use it for free. It reasons hard, reasons fast, searches the web, and most importantly, knows research. Ask the model hard questions and brainstorm with it!

William Fedus (@liamfedus) 's Twitter Profile Photo

We already have a quick pace of improvement on challenging benchmarks (4o at 3%, o1 at 9%, deep research at 27% in humanity's last exam), but expect further acceleration as AI becomes an even larger contributor to future AI development

Jason Wei (@_jasonwei) 's Twitter Profile Photo

Very excited to finally share OpenAI's "deep research" model, which achieves twice the score of o3-mini on Humanity's Last Exam, and can even perform some tasks that would take PhD experts 10+ hours to do! A few thoughts on the implications: Deep research can be seen as a new

Very excited to finally share OpenAI's "deep research" model, which achieves twice the score of o3-mini on Humanity's Last Exam, and can even perform some tasks that would take PhD experts 10+ hours to do!

A few thoughts on the implications: Deep research can be seen as a new
Zhiqing Sun (@edwardsun0909) 's Twitter Profile Photo

Excited to finally share what I’ve been working on since joining OpenAI last June! The goal of deep-research is to enable reasoning models with tools to tackle long-horizon tasks in the real world and discover new knowledge. It’s a highly autonomous agent—hand it a hard problem,

Sam Altman (@sama) 's Twitter Profile Photo

congrats to the team, especially Isa Fulford and Zhiqing Sun, for building an incredible product. my very approximate vibe is that it can do a single-digit percentage of all economically valuable tasks in the world, which is a wild milestone.

William Fedus (@liamfedus) 's Twitter Profile Photo

We're expanding deep research to more users today! The initial feedback from our Pro users has been incredible (thanks!) and we think deep research is an excellent demonstration of how reasoning models unlock reliable agents. Next, we will continue to expand the reach of this

William Fedus (@liamfedus) 's Twitter Profile Photo

Vibes are key to get right especially in subjective areas of AI products and we’re always talking about this. Love to see the vibes level-up here that goes beyond a basic try-on for new outfits

William Fedus (@liamfedus) 's Twitter Profile Photo

As AI capability continues to improve and becomes ubiquitous, a differentiator of products will come from effectively making contact with their industry and solving their specific problems. Congrats Mirror Mirror for doing this and nailing fashion aesthetics in image generation!