Zhipeng(Jason Z) Wang 🇺🇦 (@pkuwzp) Twitter Tweets • TwiCopy

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

+ Follow

ML/CV Researcher, Statistician, PhD from @RiceUniversity, Sr Manager @LinkedIn. past: @AWS, @Apple, @GoogleAI, @PKU1898 @WashU alumnus, Opinions are my own

ID: 990752318248304642

linkhttps://scholar.google.com/citations?user=OdubVmAAAAAJ&hl=en calendar_today30-04-2018 00:38:54

2,2K Tweet

486 Takipçi

734 Takip Edilen

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

4 months ago

It was so great to talk about how the DeepSpeed team has been driving the frontier of LLM model training and how we will thrive under PyTorch Foundation. Kudos to the DeepSpeed core team Tunji, Masahiro, Minjia, Jeff, Logan, Stas, Guokai and Sam who continue driving the hard

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

3 months ago

Many posts have been discussing how #claudecode and other AI toolings significantly changed the software engineering profession, and how this profession might come to an end. Here is my perspective: AI models will and should outperform human beings on tasks that leveraging

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

3 months ago

Brilliant paper!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

3 months ago

As 2026 approaches 🚀, it is a fitting moment to reflect on the achievements of 2025. This year has been exceptionally productive, and I am truly grateful for the opportunity to collaborate with outstanding teams and partners on a range of cutting-edge AI research initiatives.

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

3 months ago

Turkey grants Chinese ordinary passport visa-free access starting from Jan 2, 2026. It seems like Turkey can be one of the good places to host top ML conferences (e.g. #ICLR, #NeurIPS and #ICML etc.) given that it grants visa-free access to all US/Canada/European countries as

thumb_up_off_alt3

chat_bubble_outline2

repeat1

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

3 months ago

Check out our work on AHA: Audio Hallucination Alignment for Audio Large Language Models. In this work we built the diagnostic benchmark, proposed explicit taxonomy for audio hallucination, and also provide preference alignment datasets to help us beat SoTA performance on Audio

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

3 months ago

One controversial point: We should stop doing any live coding interviews for tech companies. Or at least let's not test candidates any tricky data structures & algorithms questions. At this moment, we are wasting everyone's time doing this, because it's no point to memorize all

thumb_up_off_alt6

chat_bubble_outline2

repeat0

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

3 months ago

As a researcher who works on both ML Systems and Multimodal models, 3D modeling and rendering is always a challenging field as it demands substantial resources and manual effort when scene editing is performed in the traditional manner. Despite recent progress in VLM-based

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

2 months ago

Check out our recent work on Efficient Knowledge Distillation of Large Reasoning Models (LRM) (arxiv.org/abs/2512.21002). Knowledge distillation (KD) of LRM involves distilling over lengthy sequences with prompt (P), chain-of-thought (CoT), and answer (A) sections, which is

thumb_up_off_alt90

chat_bubble_outline1

repeat14

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

2 months ago

Our paper "Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction" has been accepted to ICLR 2026! Kudos to Ryan Lucas, Qingquan (QQ) Song Shao Tang, Rahul Mazumder and Kayhan Behdin, and all the PCs/ACs/SPCs who put tremendous amount of work in it. See you in

Our paper "Reasoning Models Can be Accurately Pruned Via Chain-of-Thought Reconstruction" has been accepted to ICLR 2026! Kudos to Ryan Lucas, <a href="/qingquan_song/">Qingquan (QQ) Song</a> Shao Tang, Rahul Mazumder and Kayhan Behdin, and all the PCs/ACs/SPCs who put tremendous amount of work in it. See you in

thumb_up_off_alt27

chat_bubble_outline2

repeat2

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

2 months ago

🚀 Our work on "Scaling up Large Language Models Systems for Semantic Job Search" has been accepted to #MLSys2026 ! This paper highlights our years of science and engineering effort to make LLM serving more efficient, which can accommodate high-throughput, low latency online

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

2 months ago

I personally went through the experimental classes in China and grind all my way to get admission to Peking University. I ranked top 5 in my province and worked super hard. I feel one thing I appreciate Chinese education is that you don’t have too much distractions. If you like

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

2 months ago

Saw Jerry Tworek ‘s post about the necessity of coding interviews. I feel we should probably change the live coding interview to “code reviews” interview. After all it’s going to be what we need to do at jobs. During the interview we just show the candidates some piece of code,

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Zhipeng(Jason Z) Wang 🇺🇦

@pkuwzp

2 months ago

It’s crazy, by far the most capable Video Generation model.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare