I have always believed that you don't need a GPT-6 quality base model to achieve human-level reasoning performance, and that reinforcement learning was the missing ingredient on the path to AGI.
Today, we have the proof -- o1.
x.com/OpenAI/status/…
o1 is not GPT. your prompts will not "just work".
you'll have to experiment to find where it shines. you'll likely find ways to use it that even we don't know about.
go play!
ty for all the feedback so far! few notes:
1. yes we went out to 100% of plus users yesterday. this is my preferred way of shipping, even if there’s some risk demand is overwhelming
2. doing what we can about rate limits! for now we reset them because there’s so much enthusiasm
o1 prompting is alien to me. Its thinking, gloriously effective at times, is also dreamlike and unamenable to advice.
Just say what you want and pray. Any notes on “how” will be followed with the diligence of a brilliant intern on ketamine.
everyone glorifies silence and lack of distractions as the only way to focus and get things done. skill issue. you should be cranking code for 3 different projects while on a large coffee listening to industrial noise pop at 260 bpm