Dayeon (Zoey) Ki (@zoeykii) Twitter Tweets • TwiCopy

Vered Shwartz

10 months ago

I'm excited to announce that my nonfiction book, "Lost in Automatic Translation: Navigating Life in English in the Age of Language Technologies", will be published this summer by Cambridge University Press. I can't wait to share it with you! 📖🤖 cambridge.org/core/books/los…

thumb_up_off_alt164

chat_bubble_outline9

repeat26

shareShare

Ricardo Rei

@ricardorei7

6 months ago

🚀 Tower+: our latest model in the Tower family — sets a new standard for open-weight multilingual models! We show how to go beyond sentence-level translation, striking a balance between translation quality and general multilingual capabilities. 1/5 arxiv.org/pdf/2506.17080

thumb_up_off_alt24

chat_bubble_outline1

repeat8

shareShare

Dayeon (Zoey) Ki

@zoeykii

5 months ago

Super grateful to share that our work has been accepted as #ACL2025 oral presentation 🍀✨ See you in Vienna! 🇦🇹

thumb_up_off_alt26

chat_bubble_outline1

repeat8

shareShare

Dayeon (Zoey) Ki

@zoeykii

5 months ago

Why should you attend this talk? 🤔 A. Nishant put so much effort B. Learn the real limitations of MCQA C. Great takeaways for building better benchmarks D. All of the above ✔️

thumb_up_off_alt16

chat_bubble_outline2

repeat1

shareShare

Niyati Bafna

@bafnaniyati

5 months ago

📢When LLMs solve tasks with a mid-to-low resource input/target language, their output quality is poor. We know that. But can we pin down what breaks inside the LLM? We introduce the 💥translation barrier hypothesis💥 for failed multilingual generation. arxiv.org/abs/2506.22724

thumb_up_off_alt33

chat_bubble_outline1

repeat9

shareShare

Chau Minh Pham

@chautmpham

5 months ago

CLIPPER has been accepted to #COLM2025! In this work, we introduce a compression-based pipeline to generate synthetic data for long-context narrative reasoning tasks. Excited to be in Montreal this October🍁

thumb_up_off_alt70

chat_bubble_outline3

repeat9

shareShare

Vilém Zouhar

@zouharvi

5 months ago

You have a budget to human-evaluate 100 inputs to your models, but your dataset is 10,000 inputs. Do not just pick 100 randomly!🙅 We can do better. "How to Select Datapoints for Efficient Human Evaluation of NLG Models?" shows how.🕵️ (random is still a devilishly good baseline)

thumb_up_off_alt72

chat_bubble_outline2

repeat14

shareShare

LG AI Research

@lg_ai_research

5 months ago

📣Thrilled to announce the drop of EXAONE 4.0, the next-generation hybrid AI. 🙌Prepare to be amazed by EXAONE’s capabilities. #EXAONE #LG_AI_Resrarch #HybridAI #AI lgresearch.ai/blog/view?seq=…

thumb_up_off_alt74

chat_bubble_outline9

repeat29

shareShare

Alexander Hoyle

@miserlis_

5 months ago

(Repost due to mistaken deletion😢): Evaluating topic models (& doc clustering methods) is hard. In fact, since our paper critiquing standard eval practices 4 years ago, there hasn't been a good replacement metric That ends today! Our ACL paper introduces a new evaluation🧵

thumb_up_off_alt33

chat_bubble_outline0

repeat5

shareShare

Vishakh Padmakumar

@vishakh_pk

4 months ago

Maybe don't use an LLM for _everything_? Last summer, I got to fiddle again with content diversity Adobe Research Adobe and we showed that agentic pipelines that mix LLM-prompt steps with principled techniques can yield better, more personalized summaries

Maybe don't use an LLM for _everything_?

Last summer, I got to fiddle again with content diversity <a href="/AdobeResearch/">Adobe Research</a> <a href="/Adobe/">Adobe</a> and we showed that agentic pipelines that mix LLM-prompt steps with principled techniques can yield better, more personalized summaries

thumb_up_off_alt61

chat_bubble_outline1

repeat12

shareShare

Gabrielle Kaili-May Liu

@pybeebee

4 months ago

I will be presenting our work 𝗠𝗗𝗖𝘂𝗿𝗲 at #ACL2025NLP in Vienna this week! 🇦🇹 Come by if you’re interested in multi-doc reasoning and/or scalable creation of high-quality post-training data! 📍 Poster Session 4 @ Hall 4/5 🗓️ Wed, July 30 | 11-12:30 🔗 aclanthology.org/2025.acl-long.…

thumb_up_off_alt26

chat_bubble_outline0

repeat4

shareShare

Dayeon (Zoey) Ki

@zoeykii

4 months ago

I'm at #ACL2025 presenting our work on enhancing equitable cultural alignment through multi-agent debate ✨ Come visit our oral presentation! 📍Computational Social Science and Cultural Analytics session (Level 1 1.85) 📆Tuesday (7/29) 2-3:30pm 📝aclanthology.org/2025.acl-long.…

thumb_up_off_alt42

chat_bubble_outline0

repeat4

shareShare

Dayeon (Zoey) Ki

@zoeykii

4 months ago

I'll also be presenting our paper on using question-answer pairs as a new signal for spotting translation errors 🕵️ Come to talk more about MT evaluation! 📍Poster session (Hall X4, X5) 📆Tuesday (7/29) 4-5:30pm 📝aclanthology.org/2025.findings-…

thumb_up_off_alt34

chat_bubble_outline1

repeat4

shareShare

neuronpedia

@neuronpedia

4 months ago

Today, we're releasing The Circuit Analysis Research Landscape: an interpretability post extending & open sourcing Anthropic's circuit tracing work, co-authored by Paul Jankura, Google DeepMind, Goodfire EleutherAI, and Decode Research. Here's a quick demo, details follow: ⤵️

thumb_up_off_alt323

chat_bubble_outline7

repeat63

shareShare

Belen Alastruey

@b_alastruey

4 months ago

🚀New paper alert! 🚀 In our work AI at Meta we dive into the struggles of mixing languages in largely multilingual Transformer encoders and use the analysis as a tool to better design multilingual models to obtain optimal performance. 📄: arxiv.org/abs/2508.02256 🧵(1/n)

🚀New paper alert! 🚀

In our work <a href="/AIatMeta/">AI at Meta</a> we dive into the struggles of mixing languages in largely multilingual Transformer encoders and use the analysis as a tool to better design multilingual models to obtain optimal performance.

📄: arxiv.org/abs/2508.02256

🧵(1/n)

thumb_up_off_alt73

chat_bubble_outline1

repeat17

shareShare

Jay Van Bavel, PhD

@jayvanbavel

4 months ago

AI shows ingroup bias towards AI content! If we deploy LLMs in decision-making roles (e.g., purchasing goods, selecting academic submissions) they will favor LLM agents over ordinary humans pnas.org/doi/10.1073/pn…

thumb_up_off_alt198

chat_bubble_outline7

repeat65

shareShare

Yuntian Deng

@yuntiandeng

4 months ago

🚀New dataset release: WildChat-4.8M 4.8M real user-ChatGPT conversations collected from our public chatbots: - 122K from reasoning models (o1-preview, o1-mini): represent real uses in the wild and very costly to collect - 2.5M from GPT-4o 🔗 hf.co/datasets/allen… (1/4)

thumb_up_off_alt241

chat_bubble_outline5

repeat46

shareShare

CSCS Lugano

@cscsch

3 months ago

EPFL , ETH Zurich and #CSCS today released Apertus, Switzerland's first large-scale, multilingual language model (LLM). As a fully open LLM, it serves as a building block for developers and organizations to create their own applications: cscs.ch/science/comput… #Apertus #AI

<a href="/EPFL/">EPFL</a> , <a href="/ETH_en/">ETH Zurich</a> and #CSCS today released Apertus, Switzerland's first large-scale, multilingual language model (LLM). As a fully open LLM, it serves as a building block for developers and organizations to create their own applications: cscs.ch/science/comput… #Apertus #AI

thumb_up_off_alt164

chat_bubble_outline17

repeat45

shareShare