Kenny Peng (@kennylpeng) Twitter Tweets • TwiCopy

Kenny Peng

@kennylpeng

+ Follow

CS PhD student at Cornell Tech. Interested in interactions between algorithms and society. Princeton math '22.

ID: 1145703952417218562

linkhttp://kennypeng.me calendar_today01-07-2019 14:41:23

57 Tweet

97 Followers

24 Following

Raj Movva

@rajivmovva

5 months ago

1. We will present HypotheSAEs at #ICML2025, Wednesday 11am (West Hall B2-B3 #W-421). 2. Let me know if you'd like to chat about: - AI for hypothesis generation - why SAEs are still useful - whether PhD students should stay in school

thumb_up_off_alt14

chat_bubble_outline0

repeat1

shareShare

Neel Nanda

@neelnanda5

5 months ago

I've resolved this positively: 2 papers convincingly show sparse autoencoders beating baselines on real tasks: Hypothesis Generation & Auditing LLMs SAEs shine when you don't know what you're looking for, but lack precision. Sometimes the right tool for the job, sometimes not.

thumb_up_off_alt207

chat_bubble_outline6

repeat18

shareShare

Sayash Kapoor

@sayashk

5 months ago

The mainstream view of AI for science says AI will rapidly accelerate science, and that we're on track to cure cancer, double the human lifespan, colonize space, and achieve a century of progress in the next decade. In a new AI Snake Oil essay, Arvind Narayanan and I argue that

thumb_up_off_alt233

chat_bubble_outline13

repeat62

shareShare

Raj Movva

@rajivmovva

4 months ago

🌟 HypotheSAEs update: open LLMs now supported for the full hypothesis generation pipeline! Labeling SAE neurons and annotating concepts works very well with Qwen3-8B and larger models ⬇️ (notably, other models didn't work as well). Brief 🧵

thumb_up_off_alt23

chat_bubble_outline2

repeat6

shareShare

Kenny Peng

@kennylpeng

4 months ago

One paragraph pitch for why sparse autoencoders are cool: Text embeddings capture tons of information, but individual dimensions are uninterpretable. It would be great if each dimension reflected a concept (“dimension 12 is about cats”). But text embeddings are ~1000 dimensions

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Kenny Peng

@kennylpeng

4 months ago

How do we reconcile excitement about sparse autoencoders with negative results showing that they underperform simple baselines? Our new position paper makes a distinction: SAEs are very useful for tools for discovering *unknown* concepts, less good for acting on *known* concepts.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Benjamin Laufer

@bendlaufer

4 months ago

1/10. In a new paper with Hamidah Oderinwale and Jon Kleinberg, we mapped the family trees of 1.86 million AI models on Hugging Face — the largest open-model ecosystem in the world. AI evolution looks kind of like biology, but with some strange twists. 🧬🤖

thumb_up_off_alt51

chat_bubble_outline4

repeat9

shareShare

Emma Pierson

@2plus2make5

3 months ago

🚨 New postdoc position in our lab UC Berkeley EECS! 🚨 (please retweet + share with relevant candidates) We seek applicants with experience in language modeling who are excited about high-impact applications in the health and social sciences! More info in thread 1/3

🚨 New postdoc position in our lab <a href="/Berkeley_EECS/">UC Berkeley EECS</a>! 🚨 (please retweet + share with relevant candidates)

We seek applicants with experience in language modeling who are excited about high-impact applications in the health and social sciences!

More info in thread

1/3

thumb_up_off_alt135

chat_bubble_outline6

repeat40

shareShare

Emma Pierson

@2plus2make5

3 months ago

Many thanks to Open Philanthropy for supporting our work on sparse autoencoders for hypothesis generation (arxiv.org/abs/2502.04382) - in particular, using these techniques to build safer and better-aligned LLMs! openphilanthropy.org/grants/uc-berk…

thumb_up_off_alt32

chat_bubble_outline1

repeat4

shareShare

Kevin Ren

@kevinren09

2 months ago

Excited to share our paper on zero-shot LLM evaluation was accepted to Findings of #EMNLP2025! We address a practical problem by analyzing LLM predictions on 316 prediction tasks: developing methods to predict LLM performance without labeled data. arxiv.org/pdf/2509.15356

thumb_up_off_alt10

chat_bubble_outline2

repeat8

shareShare

Kenny Peng

@kennylpeng

2 months ago

Being Divya's labmate (and fellow ferry commuter) has been a real pleasure, and I've learned a ton from both her research itself and her approach to research (and also from the other random things she knows about).

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare