Yutong (Kelly) He (@electronickale) Twitter Tweets • TwiCopy

Yutong (Kelly) He

@electronickale

+ Follow

PhD student @mldcmu, I’m so delusional that doing generative modeling is my job

ID: 1375541940867887104

linkhttps://kellyyutonghe.github.io calendar_today26-03-2021 20:15:39

86 Tweet

819 Followers

382 Following

Jian Ma

@jmuiuc

a year ago

A troubling incident unfolded at #NeurIPS2024, where a keynote speaker used a slide that perpetuated harmful stereotypes and racial biases against Chinese students and researchers. I wasn't attending the conference, but I watched the talk recording and followed this closely. 1/7

thumb_up_off_alt314

chat_bubble_outline11

repeat31

shareShare

Dylan Sam

@dylanjsam

10 months ago

To trust LLMs in deployment (e.g., agentic frameworks or for generating synthetic data), we should predict how well they will perform. Our paper shows that we can do this by simply asking black-box models multiple follow-up questions! w/ Marc Finzi and Zico Kolter 1/ 🧵

thumb_up_off_alt114

chat_bubble_outline4

repeat40

shareShare

Samuel Sokota

@ssokota

9 months ago

Model-free deep RL algorithms like NFSP, PSRO, ESCHER, & R-NaD are tailor-made for games with hidden information (e.g. poker). We performed the largest-ever comparison of these algorithms. We find that they do not outperform generic policy gradient methods, such as PPO. 1/N

thumb_up_off_alt351

chat_bubble_outline9

repeat59

shareShare

Dylan Sam

@dylanjsam

9 months ago

Excited to share new work from my internship Google AI ! Curious as to how we should measure the similarity between examples in pretraining datasets? We study the role of similarity in pretraining 1.7B parameter language models on the Pile. arxiv: arxiv.org/abs/2502.02494 1/🧵

Excited to share new work from my internship <a href="/GoogleAI/">Google AI</a> !

Curious as to how we should measure the similarity between examples in pretraining datasets? We study the role of similarity in pretraining 1.7B parameter language models on the Pile.

arxiv: arxiv.org/abs/2502.02494

1/🧵

thumb_up_off_alt169

chat_bubble_outline5

repeat41

shareShare

Fahim Tajwar

@fahimtajwar10

8 months ago

Interacting with the external world and reacting based on outcomes are crucial capabilities of agentic systems, but existing LLMs’ ability to do so is limited. Introducing Paprika 🌶️, our work on making LLMs general decision makers than can solve new tasks zero-shot. 🧵 1/n

thumb_up_off_alt451

chat_bubble_outline5

repeat92

shareShare

Yutong (Kelly) He

@electronickale

8 months ago

Dear program chairs of all conferences, please don’t put a 5000 character limit on our rebuttal response, especially when the reviewers have more than ten 7500-character text boxes for them to write reviews, thank you so much

thumb_up_off_alt30

chat_bubble_outline2

repeat0

shareShare

Yutong (Kelly) He

@electronickale

6 months ago

When the ddl is approaching and you are violently editing something you wrote a while ago

thumb_up_off_alt17

chat_bubble_outline0

repeat0

shareShare

Jian Ma

@jmuiuc

6 months ago

LLOKI (a variant of Loki): x.com/jmuiuc/status/…

thumb_up_off_alt9

chat_bubble_outline1

repeat1

shareShare

Fahim Tajwar

@fahimtajwar10

6 months ago

RL with verifiable reward has shown impressive results in improving LLM reasoning, but what can we do when we do not have ground truth answers? Introducing Self-Rewarding Training (SRT): where language models provide their own reward for RL training! 🧵 1/n

thumb_up_off_alt819

chat_bubble_outline20

repeat136

shareShare

Yutong (Kelly) He

@electronickale

5 months ago

Congrats Avi! 🎉🎉🎉

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Ricky T. Q. Chen

@rickytqchen

5 months ago

Padding in our non-AR sequence models? Yuck. 🙅 👉 Instead of unmasking, our new work *Edit Flows* perform iterative refinements via position-relative inserts and deletes, operations naturally suited for variable-length sequence generation. Easily better than using mask tokens.

thumb_up_off_alt482

chat_bubble_outline8

repeat71

shareShare

Yutong (Kelly) He

@electronickale

4 months ago

🔥🔥🔥

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare