Yong Zheng-Xin (Yong) (@yong_zhengxin) Twitter Tweets • TwiCopy

Yong Zheng-Xin (Yong)

@yong_zhengxin

+ Follow

🎯 reasoning and alignment
🌎 making LLMs safe and helpful for everyone
📍 phd @BrownCSDept + research @AIatMeta @Cohere_Labs

ID: 955679485273153537

linkhttps://yongzx.github.io/ calendar_today23-01-2018 05:51:59

553 Tweet

1,1K Takipçi

1,1K Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Crosslingual Reasoning through Test-Time Scaling TL;DR: show that scaling up thinking tokens of English-centric reasoning language models, such as s1 models, can improve multilingual math reasoning performance. Also analyze the language-mixing patterns, effects of different

thumb_up_off_alt93

chat_bubble_outline7

repeat18

shareShare

Stephen Bach

@stevebach

6 months ago

Really interesting findings from Yong and many great collaborators. Test-time scaling generalizes cross-lingually, but maybe not in the way you’d hope. S1 tends to quote in the original language and then think in English.

thumb_up_off_alt19

chat_bubble_outline0

repeat2

shareShare

Ruochen Zhang not @ ICLR

@ruochenz_

6 months ago

When R1 came, I was thinking we should have a model trained to “reason” not only in English 🤔 Guess what, we show that with only English finetuning, the reasoning generalizes to other languages too! Models can also be “forced” to reason in other langs 🤯 However, more work

thumb_up_off_alt33

chat_bubble_outline1

repeat7

shareShare

MKI

@mki028

6 months ago

exploring why such mechanism occurs either from the inner mechanism and also from the data itself (using data attrib method) seems intriguing to look for :side eyes:

thumb_up_off_alt4

chat_bubble_outline2

repeat1

shareShare

Cohere Labs

@cohere_labs

6 months ago

Work led by Yong Zheng-Xin (Yong) with collaborators farid Jonibek Mansurov Ruochen Zhang Niklas Muennighoff Carsten Eickhoff Julia Kreutzer Genta Winata Stephen Bach Alham Fikri Aji 📜Paper link: arxiv.org/abs/2505.05408

thumb_up_off_alt13

chat_bubble_outline0

repeat1

shareShare

Yong Zheng-Xin (Yong)

@yong_zhengxin

6 months ago

It has been such a great experience collaborating with Julia from Cohere Labs ! Come check out our new work on how test-time scaling of English-centric models improves crosslingual reasoning 🔥 📜 arxiv.org/abs/2505.05408

thumb_up_off_alt27

chat_bubble_outline0

repeat3

shareShare

Niklas Muennighoff

@muennighoff

6 months ago

In 2022, with Yong Zheng-Xin (Yong) & team, we showed that models trained to follow instructions in English can follow instructions in other languages. Our new work below shows that models trained to reason in English can also reason in other languages!

thumb_up_off_alt65

chat_bubble_outline2

repeat8

shareShare

Genta Winata

@gentaiscool

6 months ago

⭐️Reasoning LLMs trained on English data can think in other languages. Read our paper to learn more! Thank you Yong Zheng-Xin (Yong) for leading the project and team! It was an exciting colab! farid Jonibek Mansurov Ruochen Zhang Niklas Muennighoff Carsten Eickhoff Julia Kreutzer

thumb_up_off_alt28

chat_bubble_outline0

repeat7

shareShare

Alham Fikri Aji

@alhamfikri

6 months ago

🚨Multilingual LLMs, finetuned only on English reasoning data, can still reason when asked non-English questions, showing reasoning traces that go back & forth between languages. I had so much fun working on this project Please give our paper a read! arxiv.org/abs/2505.05408

thumb_up_off_alt93

chat_bubble_outline2

repeat22

shareShare

farid

@faridlazuarda

6 months ago

Can English-finetuned LLMs reason in other languages? Short Answer: Yes, thanks to “quote-and-think” + test-time scaling. You can even force them to reason in a target language! But: 🌐 Low-resource langs & non-STEM topics still tough. New paper: arxiv.org/abs/2505.05408

thumb_up_off_alt33

chat_bubble_outline1

repeat6

shareShare

Sophia Yang, Ph.D.

@sophiamyang

6 months ago

Can an AI trained in English solve math problems in other languages without extra training?

thumb_up_off_alt628

chat_bubble_outline18

repeat82

shareShare

Shan Chen

@shan23chen

6 months ago

Designing a hard but useful benchmark has always been a passion of mine. Here we present MedBrowseComp, a deep research + computer use benchmark that is easy to verify (like BrowseComp from OpenAI) but still very expandable 💊! Project page: moreirap12.github.io/mbc-browse-app/ 1/n

thumb_up_off_alt89

chat_bubble_outline2

repeat28

shareShare

Brown CS

@browncsdept

6 months ago

Congratulations to Brown CS faculty members Stephen Bach, Ugur Çetintemel, Ellie Pavlick, and Nikos Vasilakis, who have received Brown University's OVPR Seed Award and Salomon Faculty Research Award honors! Learn more about their work at Brown CS News: cs.brown.edu/news/2025/05/2…

Congratulations to <a href="/BrownCSDept/">Brown CS</a> faculty members <a href="/stevebach/">Stephen Bach</a>, Ugur Çetintemel, Ellie Pavlick, and <a href="/nikosvasilakis/">Nikos Vasilakis</a>, who have received <a href="/BrownUniversity/">Brown University</a>'s OVPR Seed Award and Salomon Faculty Research Award honors! Learn more about their work at Brown CS News: cs.brown.edu/news/2025/05/2…

thumb_up_off_alt29

chat_bubble_outline0

repeat3

shareShare

Yong Zheng-Xin (Yong)

@yong_zhengxin

6 months ago

Thank you for having me!

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

Yong Zheng-Xin (Yong)

good girl

AK

Stephen Bach

Ruochen Zhang not @ ICLR

MKI

Cohere Labs

Yong Zheng-Xin (Yong)

Niklas Muennighoff

Genta Winata

Alham Fikri Aji

farid

Sophia Yang, Ph.D.

Shan Chen

Brown CS

Yong Zheng-Xin (Yong)