Jonathan Clark (@jonclarkseattle) Twitter Tweets • TwiCopy

Elizabeth Clark

3 years ago

We are excited to release Seahorse 🌊🐴, a ✨multilingual, multifaceted summarization evaluation dataset✨ 96,000+ human ratings to enable faster progress in training and evaluating learnt metrics for summarization! Preprint: arxiv.org/abs/2305.13194 Data: goo.gle/seahorse

thumb_up_off_alt514

chat_bubble_outline8

repeat116

shareShare

iseeaswell꩜bʂky

@iseeaswell

2 years ago

Have you ever wanted a LangID model that works on 1500+ languages? check out FUN-LangID: github.com/google-researc… !

thumb_up_off_alt50

chat_bubble_outline1

repeat8

shareShare

Sasha Rush

@srush_nlp

2 years ago

Introducing COLM (colmweb.org) the Conference on Language Modeling. A new research venue dedicated to the theory, practice, and applications of language models. Submissions: March 15 (it's pronounced "collum" 🕊️)

thumb_up_off_alt1,1K

chat_bubble_outline31

repeat425

shareShare

Dipanjan Das

@dipanjand

2 years ago

Excited to announce the First Conference on Language Modeling, to be held in approximately a year from now. Please let us know if you are interested or have any feedback on the conference: colmweb.org/survey.html

thumb_up_off_alt73

chat_bubble_outline1

repeat13

shareShare

Elizabeth Clark

@eaclark07

2 years ago

Seahorse (a multilingual summarization evaluation dataset with 96K+ human ratings) has been accepted to #EMNLP2023! See you there 🌊🐴 Bonus: The Seahorse metrics are now up on Hugging Face 🤗 hf.co/collections/go…

thumb_up_off_alt88

chat_bubble_outline4

repeat23

shareShare

Benjamin Muller

@ben_mlr

2 years ago

Excited to be presenting our work on **Evaluating and Modeling Attribution for Cross-Lingual Question Answering** at #EMNLP2023 in Singapore. Updated Paper: arxiv.org/abs/2305.14332 We're also releasing the XOR-AttriQA dataset: github.com/google-researc… 🧵

thumb_up_off_alt21

chat_bubble_outline2

repeat5

shareShare

John Wieting

@johnwieting2

2 years ago

Our work on cross-lingual and multilingual attribution will be presented at #EMNLP2023 in Singapore! We have also released our dataset of ~10k 3-way annotations over 5 typologically diverse languages. Arxiv: arxiv.org/abs/2305.14332 Dataset: github.com/google-researc…

thumb_up_off_alt24

chat_bubble_outline1

repeat8

shareShare

Dan Deutsch

@_danieldeutsch

2 years ago

Excited to receive an Outstanding Paper award for this work at EMNLP 2025! Thanks to my co-authors George Foster and Markus Freitag! Updated version available here: aclanthology.org/2023.emnlp-mai…

thumb_up_off_alt70

chat_bubble_outline4

repeat9

shareShare

John Wieting

@johnwieting2

2 years ago

Today at #NeurIPS2023, If you want to lean more about: 1. Robustness of detectors and watermarks to paraphrase attacks (spoiler alert: needs improvement). 2. An alternative detection approach using simple retrieval methods. and ...

thumb_up_off_alt20

chat_bubble_outline1

repeat4

shareShare

Graham Neubig

@gneubig

2 years ago

ACL has removed the anonymity period. This means that ACL submissions can be posted and discussed online at any time, although extensive PR is discouraged. aclweb.org/adminwiki/imag…

thumb_up_off_alt344

chat_bubble_outline5

repeat86

shareShare

Markus Freitag

@markuseful

2 years ago

New paper alert! Designing reliable human evaluation is both crucial and difficult. Human raters can exhibit different behaviors when rating NLG outputs. These differences are not generally due to a rater performing the task incorrectly, but rather due to differences in

thumb_up_off_alt73

chat_bubble_outline2

repeat17

shareShare

Jing Yu Koh

@kohjingyu

2 years ago

Absolutely unhinged. When Jason Baldridge started this in 2021 he would enthusiastically show us weird new images that he took. I thought it was just some weird phase that would fizzle out, but I'm very happy to be wrong, and that it resulted in such a high quality dataset!

Absolutely unhinged.

When <a href="/jasonbaldridge/">Jason Baldridge</a> started this in 2021 he would enthusiastically show us weird new images that he took. I thought it was just some weird phase that would fizzle out, but I'm very happy to be wrong, and that it resulted in such a high quality dataset!

thumb_up_off_alt109

chat_bubble_outline2

repeat14

shareShare

Shruti Rijhwani

@shrutirij

2 years ago

I will be at NAACL in-person, and co-organizing the AmericasNLP workshop there! #NLProc #NAACL2024

thumb_up_off_alt41

chat_bubble_outline0

repeat5

shareShare

iseeaswell꩜bʂky

@iseeaswell

a year ago

Excited to announce that 110 languages got added to Google Translate today! Time for context on these languages, especially the communities who helped a lot over the past few years, including Cantonese, NKo, and Faroese volunteers. Also, a 110-language youtube playlist. 🧵

thumb_up_off_alt236

chat_bubble_outline14

repeat59

shareShare

Jeff Dean

@jeffdean

a year ago

As part of Google's 1,000 Languages Initiative, a commitment to support the 1,000 most spoken languages, & w/help of our PaLM 2 LLM, we're adding support for 110 new languages (spoken by 614M people) to Google Translate (now supporting 243 languages). 🎉 blog.google/products/trans…

thumb_up_off_alt471

chat_bubble_outline43

repeat86

shareShare

Google Canada

@googlecanada

a year ago

Exciting news! As of today, the Inuit language of Inuktut will be available on Google Translate - marking the first Canadian Indigenous language on the platform. Tunngasugit! | ᑐᙵᓱᒋᑦ (Welcome!) 🎉 Huge thanks to ITK for their invaluable guidance and

thumb_up_off_alt47

chat_bubble_outline0

repeat24

shareShare

Jason Riesa

@jasonriesa

a year ago

🚀 Join the Gemini Multilinguality team Google DeepMind 🌐 We’re looking for researchers passionate about making LLMs helpful for all. Dramatically improve model quality, coverage, and cultural relevance across hundreds of languages. #NLProc #MultilingualAI #i18n #LLMs

thumb_up_off_alt185

chat_bubble_outline4

repeat38

shareShare

Markus Freitag

@markuseful

a year ago

Catch our Google Translate Research team at #EMNLP #WMT24! The team will present 9 papers on step-by-step decoding, mitigating metric bias within MBR decoding (+ MBR dataset release), improved human data collection and automatic metrics (MetricX: winner of WMT Metrics Task).

thumb_up_off_alt31

chat_bubble_outline1

repeat1

shareShare