Peter Jansen ( @peterjansen-ai.bsky.social ) (@peterjansen_ai) 's Twitter Profile
Peter Jansen ( @peterjansen-ai.bsky.social )

@peterjansen_ai

Associate Professor @uarizona; Visiting Scientist @allen_ai, AI/NLP; DiscoveryWorld; EntailmentBank; ScienceWorld; textgames.org list. Tweets/opinions my own

ID: 974390207867858944

linkhttp://cognitiveai.org calendar_today15-03-2018 21:01:43

5,5K Tweet

1,1K Takipçi

654 Takip Edilen

Peter Jansen ( @peterjansen-ai.bsky.social ) (@peterjansen_ai) 's Twitter Profile Photo

Can language models perform end-to-end scientific discovery? In our NeurIPS Spotlight paper, we show: very rarely. Our best model found <20% of discoveries, our best PhDs found nearly all. Paper: arxiv.org/pdf/2406.06769 Code/Web: allenai.github.io/discoveryworld Ai2 Microsoft Research

Can language models perform end-to-end scientific discovery? In our NeurIPS Spotlight paper, we show: very rarely.

Our best model found &lt;20% of discoveries, our best PhDs found nearly all.

Paper: arxiv.org/pdf/2406.06769
Code/Web: allenai.github.io/discoveryworld
<a href="/allen_ai/">Ai2</a> <a href="/MSFTResearch/">Microsoft Research</a>