Wout Schellaert (@woutschellaert) 's Twitter Profile
Wout Schellaert

@woutschellaert

PhD-student at UP Valencia working on AI evaluation. Modelling evaluation as prediction. Also other things.

ID: 834187215441973248

linkhttps://github.com/wschella calendar_today21-02-2017 23:45:05

33 Tweet

129 Takipçi

368 Takip Edilen

LinkedDataFragments (@ldfragments) 's Twitter Profile Photo

Ever wanted to query over different types of Linked Data interfaces 🤔? You can now do this with Comunica 📬. Try it out now in your browser 🎉! bit.ly/2kFSHOD Announcement: bit.ly/2kFs1xy

EFF (@eff) 's Twitter Profile Photo

BREAKING: In a huge victory, the European Parliament has voted 318-278 against #Article13 and #Article11—the disastrous #CensorshipMachine and #LinkTax copyright proposals. That means we’re close to stopping these terrible proposals—and we’re gaining momentum.

Wout Schellaert (@woutschellaert) 's Twitter Profile Photo

Still 10 full days to submit your papers for the 📐Evaluation Beyond Metrics workshop IJCAIconf. Not that you need another excuse to come, with Adina Williams and Prof. Amanda Seed giving a talk! 🔗 sites.google.com/view/ebem2022

Still 10 full days to submit your papers for the 📐Evaluation Beyond Metrics workshop <a href="/IJCAIconf/">IJCAIconf</a>.

Not that you need another excuse to come, with <a href="/adinamwilliams/">Adina Williams</a>  and <a href="/AmandaMSeed/">Prof. Amanda Seed</a> giving a talk!

🔗 sites.google.com/view/ebem2022
Wout Schellaert (@woutschellaert) 's Twitter Profile Photo

We're starting an old school mailing list for folks interested in how to evaluate AI (and all questions that come with it). Open for all to join and post! Come come! groups.google.com/g/ai-eval

We're starting an old school mailing list for folks interested in how to evaluate AI (and all questions that come with it).
Open for all to join and post! Come come!
groups.google.com/g/ai-eval
Ryan Burnell (@drryanburnell) 's Twitter Profile Photo

Interested in AI robustness and predictability? Come join us in sunny Valencia for an exciting workshop on March 8th! Information here: predictable-ai.org

Interested in AI robustness and predictability? Come join us in sunny Valencia for an exciting workshop on March 8th! Information here: predictable-ai.org
Lexin Zhou (@lexin_zhou) 's Twitter Profile Photo

1/ New paper nature! Discrepancy between human expectations of task difficulty and LLM errors harms reliability. In 2022, Ilya Sutskever Ilya Sutskever predicted: "perhaps over time that discrepancy will diminish" (youtu.be/W-F7chPE9nU, min 61-64). We show this is *not* the case!

1/ New paper <a href="/Nature/">nature</a>!

Discrepancy between human expectations of task difficulty and LLM errors harms reliability. In 2022, Ilya Sutskever <a href="/ilyasut/">Ilya Sutskever</a> predicted: "perhaps over time that discrepancy will diminish" (youtu.be/W-F7chPE9nU, min 61-64).

We show this is *not* the case!
Wout Schellaert (@woutschellaert) 's Twitter Profile Photo

This is one of the one the best (if not the best) approach to AI evaluation I've seen. You can't blabla your way to the predictive power they report in section 3.4!

Felix Reda (@senficon) 's Twitter Profile Photo

Have you ever used GitHub? At your company, your university, your NGO or at home? Then you should be worried about EU #Copyright reform: blog.github.com/2018-03-14-eu-… #CensorshipMachines #FixCopyright

Have you ever used <a href="/github/">GitHub</a>? At your company, your university, your NGO or at home? Then you should be worried about EU #Copyright reform: blog.github.com/2018-03-14-eu-… #CensorshipMachines #FixCopyright
Felix Reda (@senficon) 's Twitter Profile Photo

Free software hosted on @GitHub and elsewhere is threatened by a planned EU law meant to regulate big business infighting. Will the #opensource community discover its political clout in time to stop #UploadFilters? Here’s how you can help: juliareda.eu/2018/04/free-s…

Free software hosted on @GitHub and elsewhere is threatened by a planned EU law meant to regulate big business infighting. Will the #opensource community discover its political clout in time to stop #UploadFilters? Here’s how you can help: juliareda.eu/2018/04/free-s…