Kaifeng Lyu (@vfleaking) Twitter Tweets • TwiCopy

Zeyuan Allen-Zhu, Sc.D.

a year ago

Our 12 scaling laws (for LLM knowledge capacity) are out: arxiv.org/abs/2404.05405. Took me 4mos to submit 50,000 jobs; took Meta 1mo for legal review; FAIR sponsored 4,200,000 GPU hrs. Hope this is a new direction to study scaling laws + help practitioners make informed decisions

thumb_up_off_alt1,1K

chat_bubble_outline28

repeat333

shareShare

Xiangyu Qi

@xiangyuqi_pton

a year ago

Our recent paper shows: 1. Crrent LLM safety alignment is only a few tokens deep. 2. Deepening the safety alignment can make it more robust against multiple jailbreak attacks. 3. Protecting initial token positions can make the alignment more robust against fine-tuning attacks.

thumb_up_off_alt232

chat_bubble_outline8

repeat42

shareShare

Sanjeev Arora

@prfsanjeevarora

a year ago

1/ LLMs are often used to generate text new math questions. But can they generate challenging math questions? Current methods yield Qs that're either easy or too similar to existing questions. Our new paper "AI-Assisted Generation of Difficult Math Questions" shows how to

thumb_up_off_alt180

chat_bubble_outline3

repeat31

shareShare

Kaifeng Lyu

@vfleaking

a year ago

💡 The Mathematics of Modern Machine Learning (M3L) workshop is back for its 2nd edition at NeurIPS 2024. Submit your work and share your perspectives on modern ML theory! 📅 Submission ddl: Sept 29, 2024 (2 days after ICLR abstract ddl) 🌐 sites.google.com/view/m3l-2024

thumb_up_off_alt23

chat_bubble_outline0

repeat3

shareShare

M3L Workshop @ NeurIPS 2024

@m3lworkshop

a year ago

We've extended the #M3L submission deadline to October 1st AoE to align with ICLR timelines. We look forward to your work!

thumb_up_off_alt8

chat_bubble_outline0

repeat5

shareShare

Kaifeng Lyu

@vfleaking

8 months ago

Thanks to everyone who joined and supported the M3L workshop this year! It was so exciting to see so many inspiring ideas and discussions. Unfortunately, I got a fever one day before the workshop and couldn’t attend in person. Looking forward to seeing you all next year!

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Xingyu Zhu

@xingyuzhu_

6 months ago

Kids use open textbooks for homework. Can LLM training benefit from "helpful textbooks" in context with no gradients computed on these tokens? We call this Context-Enhanced Learning – it can exponentially accelerate training while avoiding verbatim memorization of “textbooks”!

thumb_up_off_alt185

chat_bubble_outline7

repeat20

shareShare

Rui Lu

@raylu_thu

5 months ago

🚨Ever wonder why diffusion models generate nonsensical text? Our latest study at #ICLR2025 uncovers "Local Generation Bias"—a hidden training bias causing textual hallucinations! 🧠 Key finding: Diffusion models independently generate symbols locally without global context.

thumb_up_off_alt191

chat_bubble_outline5

repeat46

shareShare

Noam Razin

@noamrazin

5 months ago

The success of RLHF depends heavily on the quality of the reward model (RM), but how should we measure this quality? 📰 We study what makes a good RM from an optimization perspective. Among other results, we formalize why more accurate RMs are not necessarily better teachers! 🧵

thumb_up_off_alt749

chat_bubble_outline7

repeat119

shareShare

Xiangyu Qi

@xiangyuqi_pton

4 months ago

We will present this paper at #ICLR2025! 1. 𝐎𝐫𝐚𝐥 𝐒𝐞𝐬𝐬𝐢𝐨𝐧 𝟏𝐃 (𝐓𝐡𝐮𝐫𝐬𝐝𝐚𝐲 𝟏𝟎:𝟒𝟐𝐚𝐦) Ashwinee Panda will give a talk 2. 𝐏𝐨𝐬𝐭𝐞𝐫 𝐒𝐞𝐬𝐬𝐢𝐨𝐧 𝟒 (𝐅𝐫𝐢𝐝𝐚𝐲 𝟑𝐩𝐦) Come to chat with Ashwinee Panda Kaifeng Lyu Xiao Ma Ahmad Beirami Unfortunately, I

thumb_up_off_alt55

chat_bubble_outline1

repeat7

shareShare

Kaifeng Lyu

@vfleaking

4 months ago

Thrilled to share that our paper “Safety Alignment Should be Made More Than Just a Few Tokens Deep” has received an ICLR 2025 Outstanding Paper Award! This project began as an effort to defend against fine-tuning attacks with constrained supervised fine-tuning (SFT). Along the

thumb_up_off_alt67

chat_bubble_outline1

repeat5

shareShare

Kaifeng Lyu

@vfleaking

4 months ago

What's the optimal learning rate schedule for LLM pretraining? Come meet us this afternoon! Poster Presentation: 🗓 Friday, April 25 🕒 3:00 PM – 5:30 PM CST 📍 Hall 3 + Hall 2B, Poster #237

thumb_up_off_alt22

chat_bubble_outline0

repeat3

shareShare

Kaifeng Lyu

@vfleaking

4 months ago

Excited to present our paper this morning at ICLR 2025, revealing the gap in CoT reasoning between RNNs and Transformers! Poster Presentation: 🗓 Saturday, April 26 📷 10:00 AM – 12:30 PM 📍 Hall 2, Poster #640

thumb_up_off_alt28

chat_bubble_outline0

repeat4

shareShare

Zhiyuan Li

@zhiyuanli_

4 months ago

Excited to share our new method ✏️PENCIL! It decouples space complexity from time complexity in LLM reasoning, by allowing model to recursively erase and generate thoughts. Joint work w. my student Chenxiao Yang , along with Nati Srebro Bartom and David McAllester.

thumb_up_off_alt35

chat_bubble_outline1

repeat9

shareShare