maggie (@ebervector) 's Twitter Profile
maggie

@ebervector

UT Austin Brain Behavior & Computation Lab, Disgraced Aggie

ID: 1654322889296838657

linkhttp://mvonebers.com calendar_today05-05-2023 03:11:32

178 Tweet

392 Takipçi

694 Takip Edilen

maggie (@ebervector) 's Twitter Profile Photo

Has anyone been running more world model metrics on o1/o3 since release (does ARC count?)? This paper popped into my head again recently and since they have a really nice open source project, I ran their logic puzzle metric on o1-mini.