Nick HK (@nickchk) 's Twitter Profile
Nick HK

@nickchk

Econ prof @SeattleU. Book The Effect theeffectbook.net out now! Check my pinned thread for all my projects. Substack nickchk.substack.com

ID: 204196157

linkhttp://nickchk.com calendar_today18-10-2010 04:12:00

17,17K Tweet

19,19K Takipçi

341 Takip Edilen

Nick HK (@nickchk) 's Twitter Profile Photo

If you are at WEAI, I will be presenting preliminary results from the Many-economists project (152 authors!) at 8:15-10am tomorrow in 702 Clearwater. #WEAI2024

Nick HK (@nickchk) 's Twitter Profile Photo

I have repeated this test with both GPT-4o and Claude 3.5 Sonnet and neither improves on this performance. Claude is worse than GPT-4 at tracking boardstate.

Nick HK (@nickchk) 's Twitter Profile Photo

what is the best resource for someone who knows R's data.table very well to learn how to use pydatatable? i dislike pandas so deeply

Nick HK (@nickchk) 's Twitter Profile Photo

At their core, LLMs like ChatGPT are big predictive models built on tons of data. LLMs are unintuitive, but we know a lot about predictive models! What can failures in traditional stats tell us about *when* and *why* LLMs fail, even as scales increase? open.substack.com/pub/nickchk/p/…

At their core, LLMs like ChatGPT are big predictive models built on tons of data. LLMs are unintuitive, but we know a lot about predictive models! What can failures in traditional stats tell us about *when* and *why* LLMs fail, even as scales increase?
 open.substack.com/pub/nickchk/p/…
Nick HK (@nickchk) 's Twitter Profile Photo

GPT-o1 has some improved performance on this task but still in the end fails it. Details: bsky.app/profile/nickch…