Wenda Xu (@wendaxu2) Twitter Tweets • TwiCopy

Wenda Xu

@wendaxu2

+ Follow

I work on evaluation of AI-generated text and LLM post-training. Research Scientist @GoogleAI. PhD @UCSB

ID: 1448188793794686979

linkhttps://xu1998hz.github.io calendar_today13-10-2021 07:28:17

304 Tweet

1,1K Followers

366 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

I am actively seeking industrial opportunities and bring expertise in training T2I and T2V diffusion models, along with a strong background in Deep RL. If you think my skills align with your needs, feel free to reach out!

thumb_up_off_alt19

chat_bubble_outline0

repeat3

shareShare

Wenda Xu

@wendaxu2

7 months ago

Michael is a rising star, fun labmate and great person! Please consider hiring him. He will surprise you with his Mandarin skill 🤓

thumb_up_off_alt13

chat_bubble_outline1

repeat0

shareShare

Xuandong Zhao

@xuandongzhao

6 months ago

I am deeply sorry and heartbroken over the loss of Felix Hill. His post docs.google.com/document/d/1aE… is a poignant reminder of the mental health challenges we face in the fast-paced and high-pressure AI field. Lately, I’ve also been feeling overwhelmed by the rapid advancements in

thumb_up_off_alt225

chat_bubble_outline2

repeat28

shareShare

Yuchen Jin

@yuchenj_uw

5 months ago

This "Aha moment" in the DeepSeek-R1 paper is huge: Pure reinforcement learning (RL) enables an LLM to automatically learn to think and reflect. This challenges the prior belief that replicating OpenAI's o1 reasoning models requires extensive CoT data. It turns out you just

thumb_up_off_alt3,3K

chat_bubble_outline87

repeat501

shareShare

Xiao Pu

@xiaosophiapu

5 months ago

🚀 Excited to share that our work has been accepted to #NAACL2025! We show that LLM watermarks can be removed in a black-box setting 🛠️ For more details: arxiv.org/abs/2411.01222

thumb_up_off_alt31

chat_bubble_outline1

repeat5

shareShare

Wenda Xu

@wendaxu2

5 months ago

Let me try it

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Lewis Tunstall

@_lewtun

5 months ago

We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open! 🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1. 🧠

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat302

shareShare

Wenda Xu

@wendaxu2

4 months ago

Are there any papers that theoretically or quantitatively demonstrate that training a language understanding model, like a metric or reward model, is easier than training a language generation model? Alternatively, should I justify this based on the differences in the output

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Wenda Xu

@wendaxu2

4 months ago

A must read paper!

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Wenda Xu

@wendaxu2

3 months ago

The best knowledge distillation tutorial so far! Also include speculative knowledge distillation : ) .

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Lei Li

@lileics

3 months ago

a newly baked Dr. Congratulations to Wenda Xu for successfully defending his phd thesis "On Evaluation and Efficient Post-training for LLMs". Highly recommend his slides: covering RL training, better KD, LLM/text gen evaluation, bias in LLM as a judge: docs.google.com/presentation/d…

thumb_up_off_alt35

chat_bubble_outline1

repeat1

shareShare

Wenda Xu

@wendaxu2

14 days ago

Finally completed this journey. Thanks a lot to my family, advisors and friends!

thumb_up_off_alt80

chat_bubble_outline10

repeat5

shareShare

Wenda Xu

Gate.io

Jiachen Li

Wenda Xu

Xuandong Zhao

Yuchen Jin

Xiao Pu

Wenda Xu

Lewis Tunstall

Wenda Xu

Wenda Xu

Wenda Xu

Lei Li

Wenda Xu