Jay DeYoung (@jaydepun) 's Twitter Profile
Jay DeYoung

@jaydepun

YI @ AI2

I am sure that my employer does not endorse anything I say.

ID: 1125865846851690497

calendar_today07-05-2019 20:51:50

13 Tweet

98 Takipçi

173 Takip Edilen

Nazneen Rajani (@nazneenrajani) 's Twitter Profile Photo

#NLProc does not have a standard benchmark for interpretability. I am stoked to announce ERASER: the first-ever effort on unifying and standardizing NLP tasks with the goal of interpretability. eraserbenchmark.com

Jan-Willem van de Meent (@jwvdm) 's Twitter Profile Photo

1/ New work by Alican (Alican) and Babak (Babak Esmaeili): "Evaluating Combinatorial Generalization in Variational Autoencoders" (arxiv.org/abs/1911.04594) In this paper we ask the question: "To what extent do VAEs generalize to unseen combinations of features?"(thread)

Lucy Lu Wang (@lucyluwang) 's Twitter Profile Photo

Sharing our #ACL2023NLP paper on evaluation for medical multi-document summarization! New human annotated dataset, new metrics, and an in-depth analysis, here: arxiv.org/abs/2305.13693 Joint w/ Yulia Jay DeYoung Thinh Truong BaileyKuehl ErinBransom Ai2 byron wallace

Sharing our #ACL2023NLP paper on evaluation for medical multi-document summarization! New human annotated dataset, new metrics, and an in-depth analysis, here: arxiv.org/abs/2305.13693

Joint w/ <a href="/YuliaOtmakhova/">Yulia</a> <a href="/jaydepun/">Jay DeYoung</a> <a href="/ththinh_/">Thinh Truong</a> BaileyKuehl ErinBransom <a href="/allen_ai/">Ai2</a> <a href="/byron_c_wallace/">byron wallace</a>