Josh (@josh_bickett) 's Twitter Profile
Josh

@josh_bickett

Sharing AI insights for coders | Lead eng @hyperwriteai @othersideai | joshbickett.com | seeking additionality

ID: 761747406513766402

linkhttps://www.willpayforthis.com calendar_today06-08-2016 02:15:09

6,6K Tweet

8,8K Followers

1,1K Following

Josh (@josh_bickett) 's Twitter Profile Photo

One area I’m especially interested to see for GPT‑5 is OpenAI’s pull‑request (PR) evaluation. OpenAI uses this benchmark to assess how close its models are to automating the job of an OpenAI research engineer. From the o3 and o4‑mini System Card: "Measuring if and when models

One area I’m especially interested to see for GPT‑5 is OpenAI’s pull‑request (PR) evaluation.

OpenAI uses this benchmark to assess how close its models are to automating the job of an OpenAI research engineer.

From the o3 and o4‑mini System Card:

"Measuring if and when models
SemiAnalysis (@semianalysis_) 's Twitter Profile Photo

GPT-4.5 is internally called Orion and was supposed to be named GPT-5, but unfortunately, it was not useful at all besides making greentext memes

will brown (@willccbb) 's Twitter Profile Photo

imagine if gas stations didn't tell you how many gallons you were getting because car mileage was a trade secret and the gas station owned the car companies and you could either buy way overpriced gas per-mile or a monthly "max gas subscription" that turns off randomly sometimes