Jesse Vig (@jesse_vig) Twitter Tweets • TwiCopy

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Ian Tenney (@[email protected])

@iftenney

3 years ago

Excited to announce v0.5 of the Google AI Learning Interpretability Tool (🔥LIT), an interactive platform to debug, validate, and understand ML model behavior. v0.5 includes exciting features and a new name! pair-code.github.io/lit/ #NLProc #googlePAIR (1/7)

Excited to announce v0.5 of the <a href="/GoogleAI/">Google AI</a> Learning Interpretability Tool (🔥LIT), an interactive platform to debug, validate, and understand ML model behavior. v0.5 includes exciting features and a new name! pair-code.github.io/lit/ #NLProc #googlePAIR (1/7)

thumb_up_off_alt84

chat_bubble_outline2

repeat22

shareShare

Yonatan Belinkov

@boknilev

3 years ago

David Bau giving keynote 3 on Direct Model Editing and Mechanistic Interpretability

<a href="/davidbau/">David Bau</a> giving keynote 3 on Direct Model Editing and Mechanistic Interpretability

thumb_up_off_alt8

chat_bubble_outline2

repeat2

shareShare

Yonatan Belinkov

@boknilev

3 years ago

People have been asking for slides of our ACL 2020 tutorial w/ Sebastian Gehrmann Ellie Pavlick Brown NLP on Interpretability and analysis of #nlproc. Thanks to ACL Anthology team it’s now here: aclanthology.org/2020.acl-tutor… Hopefully still useful though much has changed in the field since.

thumb_up_off_alt47

chat_bubble_outline1

repeat7

shareShare

Alex Fabbri

@alexfabbri4

3 years ago

🚨🆕📄🚨 How gold is your human evaluation? We seek the answer, and its implications in the GPT3 era, in our preprint “Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation” Paper: arxiv.org/abs/2212.07981 Equal contribution Yixin Liu

thumb_up_off_alt96

chat_bubble_outline5

repeat21

shareShare

Alex Fabbri

@alexfabbri4

3 years ago

You can explore the ACU annotations in Rose🌹along with protocol results on our demo page and start using our dataset! Repo: github.com/Yale-LILY/ROSE Demo page: yale-lily.github.io/ROSE/ Dataset: huggingface.co/datasets/Sales…

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Chien-Sheng (Jason) Wu

@jasonwu0731

3 years ago

Human preference != Job Done. Check our interesting findings (ROSE🌹) on summarization! Thanks to my fantastic collaborators Alex Fabbri Yixin Liu Pengfei Liu Yilun Zhao Linyong Nan Ruilin Han Sophia Simeng Han Shafiq Joty Caiming Xiong Dragomir Radev!

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Wojciech Kryściński

@iam_wkr

3 years ago

Very excited to have the opportunity to present research done at Salesforce AI Research on Automatic Text Summarization at Zespół Inżynierii Lingwistycznej IPI PAN „Long Story Short: A Talk about Text Summarization” will cover the current state of the field, existing challenges, and future directions.

thumb_up_off_alt22

chat_bubble_outline1

repeat2

shareShare

Jesse Vig

@jesse_vig

3 years ago

How can NLP help us understand the diversity of news coverage of a topic? Check out the latest work from Philippe Laban et al. appearing at #CHI2023 this week.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Yixin Liu

@yixinliu17

2 years ago

Delighted to announce our paper has been accepted for an oral presentation at #ACL2023 oral! In this work we emphasize the intricate complexity of human evaluation while it is becoming even more crucial for both model training and evaluation in the LLM era.

thumb_up_off_alt40

chat_bubble_outline1

repeat8

shareShare

Caiming Xiong

@caimingxiong

2 years ago

Finding a document too dense to decipher? 🤔Content a bit convoluted? Essay too esoteric? Check how we simplify and improve document readability using SWiPE. Join us in making knowledge accessible to all! 🌐 🔗Paper: arxiv.org/abs/2305.19204 🔗Github: github.com/salesforce/sim…

thumb_up_off_alt49

chat_bubble_outline1

repeat14

shareShare

Caiming Xiong

@caimingxiong

2 years ago

By aligning Wikipedia articles to their simplified versions on Simple Wikipedia, we reconstruct the process by which human editors simplify whole documents, in contrast to prior work focused on sentence-level simplification.

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Wojciech Kryściński

@iam_wkr

2 years ago

Check out our work on text simplification! SWiPE accepted at #ACL2023

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

WikiResearch

@wikiresearch

2 years ago

"SWIPE: A Dataset for Document-Level Simplification of Wikipedia Pages" leveraging the entire revision history when pairing enwiki/simplewiki pages, to identify simplification edits. (Laban et al, 2023) arxiv.org/pdf/2305.19204… Wojciech Kryściński

thumb_up_off_alt32

chat_bubble_outline0

repeat9

shareShare

Jesse Vig

@jesse_vig

2 years ago

Check out our latest work on text simplification at #acl2023nlp .

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Caiming Xiong

@caimingxiong

2 years ago

🤔Which words in your prompt are most helpful to language models? In our #ACL2023NLP paper, we explore which parts of task instructions are most important for model performance. 🔗 arxiv.org/abs/2306.01150 Code: github.com/fanyin3639/Ret…

thumb_up_off_alt200

chat_bubble_outline3

repeat42

shareShare

Caiming Xiong

@caimingxiong

2 years ago

Excited to share a new preprint on the 🩴FlipFlop Effect. We prompt LLMs with a classification task, and challenge the model by following up with “Are you sure?”. The model can confirm or flip its answer. The results? More flips than a gymnastics competition! 🤸‍♂️ 1/N

thumb_up_off_alt141

chat_bubble_outline4

repeat32

shareShare

Philippe Laban

@philippelaban

2 years ago

Excited to share this fun new work on the 🩴FlipFlop Effect. In short: if you ask models if they're sure of their answers, they tend to change their minds (and severely degrade accuracy). What's mindblowing is how universal the effect is across LLMs (GPTs, Gemini, Claudes, …).

thumb_up_off_alt34

chat_bubble_outline3

repeat7

shareShare

Jesse Vig

@jesse_vig

a year ago

Congratulations David Wan for this great collaboration between Salesforce Research and UNC! Shafiq Joty Mohit Bansal

thumb_up_off_alt29

chat_bubble_outline1

repeat5

shareShare