Zhenwen Liang (@liangzhenwen) Twitter Tweets • TwiCopy

Zhenwen Liang

@liangzhenwen

+ Follow

Resesrch Scientist in NLP, Tencent AI Lab, Seattle. Previous intern at Salesforce AI Research, AI2 and Tencent AI Lab.

ID: 1516449485295132680

linkhttps://Zhenwen-NLP.github.io calendar_today19-04-2022 16:11:58

72 Tweet

972 Takipçi

273 Takip Edilen

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

New paper: VLMs can self-reward during RL training — no visual annotations needed! -- Decompose VLM reasoning into visual vs. language parts -- Prompt the same VLM without visual input for visual reward We call it 𝐕𝐢𝐬𝐢𝐨𝐧-𝐒(𝐞𝐥𝐟)𝐑𝟏: arxiv.org/abs/2508.19652

thumb_up_off_alt449

chat_bubble_outline7

repeat91

shareShare

Zhenwen Liang

@liangzhenwen

2 months ago

❓ What if an LLM trying to self-improve without labels... actually makes itself worse? Check out our EVOL-RL!

thumb_up_off_alt7

chat_bubble_outline0

repeat1

shareShare

Zhenwen Liang

@liangzhenwen

2 months ago

Our MPS-Prover has been accepted by NeurIPS 2025. See you in SD~

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Wenhao Yu

@wyu_nd

a month ago

Code for 𝐏𝐚𝐫𝐚𝐥𝐥𝐞𝐥-𝐑𝟏 is live! 👉 github.com/zhengkid/Paral… (now 189 stars and climbing 🔥) It lets LLMs think in parallel — multiple reasoning paths, smarter synthesis, more creative inference! Miss this paper and you’re missing a leap forward: arxiv.org/abs/2509.07980

thumb_up_off_alt237

chat_bubble_outline2

repeat48

shareShare

Yuexing Hao

@yuexinghao

a month ago

More GPU means “better” foundation model (FM) research?? We looked at 6517 FM papers and surveyed 229 FM authors to understand the role of computing resources in publishing. Surprisingly….. arxiv.org/abs/2510.13621

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Zhenwen Liang

@liangzhenwen

a month ago

I will be at ICCV 2025, happy to chat about our internship opportunities. See you at Hawaii🌴

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare