Michael Hu (@michahu8) Twitter Tweets • TwiCopy

Michael Hu

@michahu8

+ Follow

PhD candidate @NYU. NLP, training data, RL.
@NSF GRFP fellow. Previously @princeton_nlp, @cocosci_lab.

ID: 1166406061378723843

linkhttps://michahu.github.io/ calendar_today27-08-2019 17:44:23

95 Tweet

594 Followers

538 Following

Kyunghyun Cho

@kchonyc

9 months ago

it is my great honour to be appointed as the Glen se Vries Professor of Health Statistics. i have quickly written about this in my blog post: kyunghyuncho.me/glen-de-vries-…

thumb_up_off_alt339

chat_bubble_outline32

repeat17

shareShare

!!! I'm at #ICLR2025 to present 🧄Aioli🧄 a unified framework for data mixing on Thursday afternoon! 🔗 arxiv.org/abs/2411.05735 Message me to chat about pre/post training data (mixing, curriculum, understanding); test-time compute/verification; or to try new food 🇸🇬

thumb_up_off_alt155

chat_bubble_outline2

repeat53

shareShare

Michael Hu

@michahu8

6 months ago

hot multi agent researcher summer

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Michael Hu

@michahu8

5 months ago

RL can certainly teach LLMs new skills in principle, but in practice token-level exploration is so challenging that we end up relying on pretraining and synthetic data. the era of experience implies the era of exploration

thumb_up_off_alt12

chat_bubble_outline0

repeat1

shareShare

Michael Hu

@michahu8

4 months ago

I'll be at #ACL2025 next week! 🇦🇹 Things on my mind: curriculum learning, online adaptation, LM agents Where to find me: 1⃣ Monday: my team's poster on PeopleJoin (interning at Microsoft) 2⃣ Wednesday: discussing pre-pretraining in Panel 1 Excited to chat! DMs are open 😊

thumb_up_off_alt44

chat_bubble_outline0

repeat6

shareShare

Yuntian Deng

@yuntiandeng

2 months ago

Every time I watch models train, I wish I could tune LR on the fly. It's like cooking: we adjust the dial when the food smells off. We built Interactive Training to do that, turning loss monitoring into interaction. Paper👉huggingface.co/papers/2510.02… Led by Wentao Zhang w/ Yang Lu

thumb_up_off_alt198

chat_bubble_outline5

repeat35

shareShare

Michael Hu

@michahu8

2 months ago

i'm getting really tired of "it's not x, it's y" and "do x, not y" in peoples' writing (more accurately, in their ghostwriter's writing). when i see it i honestly just move on and try to delete what i saw from my memory it's not just sloppy, it's also tasteless

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Eric Bigelow

@ericbigelow

25 days ago

📝 New paper! Two strategies have emerged for controlling LLM behavior at inference time: in-context learning (ICL; i.e. prompting) and activation steering. We propose that both can be understood as altering model beliefs, formally in the sense of Bayesian belief updating. 1/9

thumb_up_off_alt120

chat_bubble_outline8

repeat21

shareShare

Sabri Eyuboglu

@eyuboglusabri

16 days ago

Everything I know about data mixing, I've learned from Mayee. The best mixer out there👩‍🍳 Very excited that her insights are now part of Olmo's open recipe

thumb_up_off_alt46

chat_bubble_outline1

repeat2

shareShare

Michael Hu

@michahu8

12 days ago

It's rare to find formidable theory chops, engineering skill, and a taste for important problems in the same researcher, but that's Will! apply apply apply

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Michael Hu

Kyunghyun Cho

Mayee Chen

Michael Hu

Michael Hu

Michael Hu

Yuntian Deng

Michael Hu

Eric Bigelow

Sabri Eyuboglu

Michael Hu