Pietro Lesci (@pietro_lesci) Twitter Tweets • TwiCopy

Pietro Lesci

@pietro_lesci

+ Follow

PhD student @cambridge_uni. Causality & language models. Former @bainandcompany, intern @ecb @amazonscience. Passionate musician, professional debugger.

ID: 1018592521600028679

linkhttps://pietrolesci.github.io calendar_today15-07-2018 20:25:56

725 Tweet

642 Followers

1,1K Following

EleutherAI

@aieleuther

7 months ago

Like Pythia, but 1234 isn't your favorite random seed? We retrained Pythia 9x using different random seeds to explore how stable analyses of learning dynamics are to randomness. Meet Pietro Lesci @blancheminerva Fri 1500-1730 Hall 3 + Hall 2B #259 x.com/pietro_lesci/s…

thumb_up_off_alt6

chat_bubble_outline1

repeat3

shareShare

Nedjma Ousidhoum نجمة أوسيدهم

@nedjmaou

6 months ago

Happy to see our paper win ✨the Best Theme Paper Award✨ at #NAACL2025 ! Working on this project with this team was a lot of fun :-) huge thanks to Genta Winata, Frederikus Hudi,Patrick Amadeus,@davidanugraha, Rifki Afina Putri for leading!

thumb_up_off_alt36

chat_bubble_outline2

repeat4

shareShare

Andreas Vlachos

@vlachos_nlp

6 months ago

The call for papers for the 8th FEVERworkshop at #ACL is out: fever.ai/workshop.html Deadline for is on May 19th! And if you have a paper already reviewed in ARR, you can commit it until June 9th!

thumb_up_off_alt13

chat_bubble_outline1

repeat9

shareShare

Workshop on Large Language Model Memorization

@l2m2_workshop

6 months ago

📢 ACL 2025 notifications have been sent out, making this the perfect time to finalize your commitment. Don't miss the opportunity to be part of the workshop! 🔗 Commit here: openreview.net/group?id=aclwe… 🗓️ Deadline: May 20, 2025 (AoE) #ACL2025 #NLProc

thumb_up_off_alt11

chat_bubble_outline1

repeat7

shareShare

Xiaochen Zhu (Neo)

@zhuneo13294

5 months ago

Inception Lab and Gemini Diffusion are hot these days. Just published a blog post on Diffusion Language Models! 🚀 Exploring how diffusion (yes, the image model kind) can be used for text generation. Check it out👇 spacehunterinf.github.io/blog/2025/diff… #NLP #LLMs #DiffusionModels

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

Tiago Pimentel

@tpimentelms

5 months ago

If you're finishing your camera-ready for ACL (#acl2025nlp) or ICML (#icml2025 ) and want to cite co-first authors more fairly, I just made a simple fix to do this! Just add $^*$ to the authors' names in your bibtex, and the citations should change :) github.com/tpimentelms/ac…

thumb_up_off_alt195

chat_bubble_outline4

repeat25

shareShare

Caiqi Zhang

@caiqizh

5 months ago

🔥 We teach LLMs to say how confident they are on-the-fly during long-form generation. 🤩No sampling. No slow post-hoc methods. Not limited to short-form QA! ‼️Just output confidence in a single decoding pass. ✅Better calibration! 🚀 20× faster runtime. arXiv:2505.23912 👇

thumb_up_off_alt39

chat_bubble_outline2

repeat22

shareShare

Tiago Pimentel

@tpimentelms

5 months ago

A string may get 17 times less probability if tokenised as two symbols (e.g., ⟨he, llo⟩) than as one (e.g., ⟨hello⟩)—by an LM trained from scratch in each situation! Our #acl2025nlp paper proposes an observational method to estimate this causal effect! Longer thread soon!

thumb_up_off_alt80

chat_bubble_outline2

repeat16

shareShare

Tiago Pimentel

@tpimentelms

5 months ago

If you use LLMs, tokenisation bias probably affects you: * Text generation: tokenisation bias ⇒ length bias 🤯 * Psycholinguistics: tokenisation bias ⇒ systematically biased surprisal estimates 🫠 * Interpretability: tokenisation bias ⇒ biased logits 🤔

thumb_up_off_alt15

chat_bubble_outline0

repeat2

shareShare

Andreas Vlachos

@vlachos_nlp

5 months ago

Looking forward to this year's edition! With great speakers: Ryan McDonald Yulan He Vlad Niculae Antonis Anastasopoulos Raquel Fernández Anna Rogers Preslav Nakov Mohit Bansal Eunsol Choi Marie-Catherine de Marneffe !

thumb_up_off_alt24

chat_bubble_outline1

repeat12

shareShare