Clara Isabel Meister (@clara__meister) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Clara Isabel Meister

@clara__meister

2 years ago

Join our next Zurich NLP meetup, happening next week at the ETH AI center!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

"A Measure-Theoretic Characterization of Tight Language Models" Draft: arxiv.org/abs/2212.10502 By Leo Du (JHU) Lucas Torroba-Hennigen Tiago Pimentel Clara Isabel Meister Jason Eisner (JHU) @ryandcotterell TLDR; Formalizes LMs' distribution (i.e., whether generative process terminates with prob 1.)

thumb_up_off_alt11

chat_bubble_outline2

repeat2

shareShare

JHU CLSP

@jhuclsp

2 years ago

"Tokenization and the Noiseless Channel" Draft: coming soon! By Vilém Zouhar Clara Isabel Meister Gianni Gastaldi @giannig.bsky.social Leo Du (JHU) Mrinmaya Sachan @ryandcotterell TLDR; Develops information-theoretic efficiency measures of subword tokenization alg + theoretical bounds for such measures.

thumb_up_off_alt13

chat_bubble_outline2

repeat4

shareShare

Rabeeh Karimi

@karimirabeeh

2 years ago

1/8 Excited to share the result of my internship at Ai2 with Arman Cohan Iz Beltagy Matthew Peters James Henderson Hamish Ivison Jake Tae . We propose TESS: a Text-to-text Self-conditioned Simplex Diffusion model arxiv.org/abs/2305.08379

1/8 Excited to share the result of my internship at <a href="/allen_ai/">Ai2</a> with <a href="/armancohan/">Arman Cohan</a> <a href="/i_beltagy/">Iz Beltagy</a> <a href="/mattthemathman/">Matthew Peters</a> <a href="/JamieBHenderson/">James Henderson</a> <a href="/hamishivi/">Hamish Ivison</a> <a href="/jaesungtae/">Jake Tae</a> . We propose TESS: a Text-to-text Self-conditioned Simplex Diffusion model arxiv.org/abs/2305.08379

thumb_up_off_alt135

chat_bubble_outline2

repeat21

shareShare

Thomas Hikaru Clark

@thomashikaru

2 years ago

Has a language’s word order been affected by a pressure for uniform information density? We investigate this Q in our upcoming TACL paper 🧵 with Clara Isabel Meister Tiago Pimentel Michael Hahn @ryandcotterell Richard Futrell Roger Levy arxiv.org/abs/2306.03734

Has a language’s word order been affected by a pressure for uniform information density? We investigate this Q in our upcoming TACL paper 🧵
with <a href="/clara__meister/">Clara Isabel Meister</a> <a href="/tpimentelms/">Tiago Pimentel</a> <a href="/mhahn29/">Michael Hahn</a> @ryandcotterell <a href="/rljfutrell/">Richard Futrell</a> <a href="/roger_p_levy/">Roger Levy</a>
arxiv.org/abs/2306.03734

thumb_up_off_alt104

chat_bubble_outline4

repeat22

shareShare

Vilém Zouhar

@zouharvi

2 years ago

I'm elated to present our two latest projects on tokenization. 🧩🧩 The first formalizes Byte-Pair Encoding and finds a nice bound to its greediness. arxiv.org/abs/2306.16837 youtube.com/watch?v=aB7oaS…

thumb_up_off_alt217

chat_bubble_outline1

repeat40

shareShare

Kyle Mahowald

@kmahowald

2 years ago

Computational psycholinguists and friends take note: Marten van Schijndel and I are co-editing, under the aegis of @AdrianBStaub, a special issue of JML on language models and psycholinguistics! Call: sciencedirect.com/journal/journa…. I'm happy to chat about this in Toronto at #ACL2023NLP!

thumb_up_off_alt68

chat_bubble_outline2

repeat28

shareShare

Clara Isabel Meister

@clara__meister

2 years ago

Come to our ACL tutorial tomorrow at 14h on generating text from language models! Material will be online here: rycolab.io/classes/acl-20… w/ Tiago Pimentel Afra Amini John Hewitt Luca Malagutti @ryandcotterell

thumb_up_off_alt82

chat_bubble_outline0

repeat22

shareShare

Ethan Gotlieb Wilcox

@wegotlieb

2 years ago

🚨🚨New Paper Announcement (to appear in TACL) 📜 from me, Tiago Pimentel, Clara Isabel Meister, @ryandcotterell and Roger Levy: Testing the Predictions of Surprisal Theory in 11 Languages arxiv.org/abs/2307.03667 🌎

thumb_up_off_alt51

chat_bubble_outline1

repeat16

shareShare

Clara Isabel Meister

@clara__meister

2 years ago

If you're at #ACL2023NLP, stop by the poster session tomorrow @ 16h for our paper "On the Efficacy of Sampling Adapters"! Hope to see some of you there :) arxiv.org/pdf/2307.03749… Tiago Pimentel Luca Malagutti Ethan Gotlieb Wilcox @ryandcotterell

thumb_up_off_alt23

chat_bubble_outline0

repeat2

shareShare

ZurichAI

@zurichnlp

2 years ago

We have an absolutely stellar meetup coming up this October 19th @ 6:00 PM! Jonas Pfeiffer from Google DeepMind will be presenting followed by Eiso Kant co-founder and CTO of poolside. Last meetup we ran out of spots! RSVP: zurich-nlp.ch/event/zurich-n…

thumb_up_off_alt13

chat_bubble_outline0

repeat6

shareShare

Tiago Pimentel

@tpimentelms

2 years ago

Are you interested in word lengths and natural language’s efficiency? If yes, our new #EMNLP2023 paper has everything you need: drama, suspense, a new derivation of Zipf’s law, an update to Piantadosi et al’s classic word length paper, transformers... 🧵 arxiv.org/abs/2312.03897

thumb_up_off_alt185

chat_bubble_outline1

repeat23

shareShare

Ethan Gotlieb Wilcox

@wegotlieb

2 years ago

Thank you to #EMNLP2023 chairs for the 😱 two 😱 outstanding paper awards! I am so grateful to have worked on these projects with wonderful colleagues — Tiago Pimentel (who is the first author on one of the papers!), Clara Isabel Meister, Kyle Mahowald and @ryandcotterell

Thank you to #EMNLP2023 chairs for the 😱 two 😱 outstanding paper awards! I am so grateful to have worked on these projects with wonderful colleagues — <a href="/tpimentelms/">Tiago Pimentel</a> (who is the first author on one of the papers!), <a href="/clara__meister/">Clara Isabel Meister</a>, <a href="/kmahowald/">Kyle Mahowald</a> and @ryandcotterell

thumb_up_off_alt327

chat_bubble_outline14

repeat14

shareShare

ZurichAI

@zurichnlp

a year ago

New year, new meetup! Join us on January 16th 18:00 - 20:00 for talks from: ➡ Martina Forster and Luca Campanella from Typewise ➡ Tiago Pimentel from ETH Zurich We filled out 50/150 RSVP's yesterday, spots are filling up fast! 🚀 zurich-nlp.ch/event/zurich-n…

thumb_up_off_alt8

chat_bubble_outline1

repeat4

shareShare

Ethan Gotlieb Wilcox

@wegotlieb

a year ago

🔔🌟 New Preprint Alert 🔔🌟 “An Information-Theoretic Analysis of Targeted Regressions during Reading” with Tiago Pimentel , Clara Isabel Meister , @ryandcotterell - Psycholinguistics 🧠 Computational Modeling 🤖 Crosslinguistic Studies 🌍 Information Theory 📡 osf.io/preprints/psya…

thumb_up_off_alt44

chat_bubble_outline1

repeat9

shareShare

Pietro Lesci

@pietro_lesci

a year ago

Happy to share our #ACL2024 paper: "Causal Estimation of Memorisation Profiles" 🎉 Drawing from econometrics, we propose a principled and efficient method to estimate memorisation using only observational data! See 🧵 +Clara Isabel Meister, Thomas Hofmann, Andreas Vlachos, Tiago Pimentel

thumb_up_off_alt102

chat_bubble_outline3

repeat19

shareShare

Tiago Pimentel

@tpimentelms

a year ago

Do you want to quantify your model’s counterfactual memorisation using only observational data? Our #ACL2024NLP paper proposes an efficient method to do it :) No interventions required! You can also see how memorisation evolves across training! Check out Pietro's🧵for details :)

thumb_up_off_alt34

chat_bubble_outline0

repeat3

shareShare

Tiago Pimentel

@tpimentelms

a year ago

Hey #NLProc and #psycholing Twitter :) We found a bug in how we're all computing contextual word probabilities and wrote a paper about it! It's a very easy fix, so please check it out! +Clara Isabel Meister

thumb_up_off_alt130

chat_bubble_outline4

repeat18

shareShare

Pietro Lesci

@pietro_lesci

10 months ago

Super excited and grateful that our paper received the best paper award at #ACL2024 🎉 Huge thanks to my fantastic co-authors — Clara Isabel Meister, Thomas Hofmann, Andreas Vlachos, and Tiago Pimentel — the reviewers that recommended our paper, and the award committee #ACL2024NLP

Super excited and grateful that our paper received the best paper award at #ACL2024 🎉

Huge thanks to my fantastic co-authors — <a href="/clara__meister/">Clara Isabel Meister</a>, Thomas Hofmann, <a href="/vlachos_nlp/">Andreas Vlachos</a>, and <a href="/tpimentelms/">Tiago Pimentel</a> — the reviewers that recommended our paper, and the award committee #ACL2024NLP

thumb_up_off_alt76

chat_bubble_outline7

repeat7

shareShare

Tiago Pimentel

@tpimentelms

20 days ago

A string may get 17 times less probability if tokenised as two symbols (e.g., ⟨he, llo⟩) than as one (e.g., ⟨hello⟩)—by an LM trained from scratch in each situation! Our #acl2025nlp paper proposes an observational method to estimate this causal effect! Longer thread soon!

thumb_up_off_alt80

chat_bubble_outline2

repeat16

shareShare