ilge (@ilge) 's Twitter Profile
ilge

@ilge

Research @OpenAI | there is as yet insufficient data for a meaningful answer.

ID: 250500792

linkhttp://ilge.github.io calendar_today11-02-2011 06:17:50

486 Tweet

3,3K Followers

299 Following

Ahmed El-Kishky (@ahelkky) 's Twitter Profile Photo

My colleagues and I will be hosting a talk and Q&A session on 'Learning to Reason with LLMs' and the new OpenAI o1 model. Join us for an insightful discussion! forum.openai.com/public/events/
 #OpenAIForum

Sonya Huang đŸ„ (@sonyatweetybird) 's Twitter Profile Photo

Our most exciting episode of Training Data yet 🍓🍰 OpenAI’s o1 represents a major leap forward by giving models time to "think." Inference-time compute is the next big research frontier. Thrilled to have Noam Brown, ilge, and hunter on the show Pat Grady Sequoia Capital

OpenAI Developers (@openaidevs) 's Twitter Profile Photo

Introducing canvas—your coding surface in ChatGPT. ✏ Edit code inline 🐛 Review code and fix bugs 💬 Add logs and comments 🚱 Port to different languages We’ll be adding more to canvas over time. ChatGPT Plus and Team users can try the beta starting today.

ilge (@ilge) 's Twitter Profile Photo

I followed the trend of asking ChatGPT: “out of all the data you have on me, generate an image that captures me the way you see me”. 💓

I followed the trend of asking ChatGPT: “out of all the data you have on me, generate an image that captures me the way you see me”.  💓
François Chollet (@fchollet) 's Twitter Profile Photo

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks.

It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task
Chelsea Sierra Voss (@csvoss) 's Twitter Profile Photo

don’t miss this part of today’s 12th Day of OpenAI: “Deliberative Alignment,” exciting work by the illustrious Melody Guan ʕᔔᎄᔔʔ et al! the technique achieves a Pareto improvement over previous approaches such as RLHF, and reduces overrefusals! openai.com/index/delibera


Nat McAleese (@__nmca__) 's Twitter Profile Photo

Epoch AI are going to publish more details, but on the OpenAI side for those interested: we did not use FrontierMath data to guide the development of o1 or o3, at all. (1/n)

OpenAI (@openai) 's Twitter Profile Photo

OpenAI o3-mini is now available in ChatGPT and the API. Pro users will have unlimited access to o3-mini and Plus & Team users will have triple the rate limits (vs o1-mini). Free users can try o3-mini in ChatGPT by selecting the Reason button under the message composer.

Alexander Wei (@alexwei_) 's Twitter Profile Photo

1/N I’m excited to share that our latest OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).

1/N I’m excited to share that our latest <a href="/OpenAI/">OpenAI</a> experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).
Aleksander Holynski (@holynski_) 's Twitter Profile Photo

Another one. Already a powerful painting, but moving around it yourself gives a totally different feeling. Jacques Louis David's "The Death of Socrates" => #Genie3

Sheryl Hsu (@sherylhsu02) 's Twitter Profile Photo

2/n We officially competed in the online AI track of the IOI, where we scored higher than all but 5 (of 330) human participants and placed first among AI participants. We had the same 5 hour time limit and 50 submission limit as human participants. Like the human contestants, our

2/n We officially competed in the online AI track of the IOI, where we scored higher than all but 5 (of 330) human participants and placed first among AI participants. We had the same 5 hour time limit and 50 submission limit as human participants. Like the human contestants, our