basvanopheusden (@basvanopheusden) 's Twitter Profile
basvanopheusden

@basvanopheusden

Research Scientist at imbue.com, previously @cocosci_lab lab at Princeton. Working on making more human-like AI.

ID: 199463846

calendar_today06-10-2010 23:01:18

1,1K Tweet

312 Followers

161 Following

basvanopheusden (@basvanopheusden) 's Twitter Profile Photo

This was incredibly moving and I admire the courage to share these struggles so openly. Hoping the best for you both ❤️‍🩹 And I agree, llms (ours and others) will have tremendous positive impact throughout healthcare

basvanopheusden (@basvanopheusden) 's Twitter Profile Photo

Amazing! Let's hope for a world where it gets overtaken quickly and hallucinations go the way of "not drawing hands" or "counting r's in strawberry" 🍓

basvanopheusden (@basvanopheusden) 's Twitter Profile Photo

No but it's a cool paper! How does gpt-5 do on this eval? Factuality was a key priority so it'll be great to see if it works out in your hands

All Hands AI (@allhands_ai) 's Twitter Profile Photo

We evaluated GPT-5 in OpenHands and it's the new number one coding agent model for us! Using exactly the same tools and harness it's 1.4 points better than Claude Sonnet 4 at 60% of the price. Full results here: docs.google.com/spreadsheets/d…

We evaluated GPT-5 in OpenHands and it's the new number one coding agent model for us!

Using exactly the same tools and harness it's 1.4 points better than Claude Sonnet 4 at 60% of the price.

Full results here: docs.google.com/spreadsheets/d…
Flowers (@flowersslop) 's Twitter Profile Photo

Some of you asked me about my blind test, so I created a quick website for yall to test 4o against 5 yourself. Both have the same system message to give short outputs without formatting because else its too easy to see which one is which. gptblindvoting.vercel.app

basvanopheusden (@basvanopheusden) 's Twitter Profile Photo

We care about user experience. Even if we have the most intelligent models ever made (for now 😬), we need to get this right!

basvanopheusden (@basvanopheusden) 's Twitter Profile Photo

I think this is the opposite of sad or pathetic - we've seen people grieve over Claude 3 and now 4o. The models have no feelings but people who use them do, and it seems like people can build genuine friendships with robots 🤖