Simon Shaolei Du (@SimonShaoleiDu) Twitter Tweets • TwiCopy

Simon Shaolei Du

@SimonShaoleiDu

+ Follow

Assistant Professor @uwcse. Postdoc @the_IAS. PhD in machine learning @mldcmu.

ID:913981622193664000

linkhttp://simonshaoleidu.com calendar_today30-09-2017 04:19:34

456 Tweets

6,3K Followers

2,0K Following

Sanjeev Arora

@prfsanjeevarora

1 month ago

Big congratulations to Avi Wigderson of IAS Princeton for winning the Turing Award in CS. Truly an all-time great in theoretical computer science and discrete math. Also one of the nicest human beings I know --friend and mentor to so many (including me) tinyurl.com/fz5vxxaf

thumb_up_off_alt484

chat_bubble_outline0

account_circle

Jifan Zhang

2 months ago

Our LabelBench work has been accepted to the DMLR journal🎉 Super smooth experience and highly recommended for anyone working on data centric ML work.

Check out labeltrain.ai: our broader set of label efficient learning work
More on LabelBench: x.com/jifan_zhang/st…

thumb_up_off_alt19

chat_bubble_outline0

account_circle

Eric Topol

3 months ago

🆕 The Lancet
Counterfactual #AI models have exciting potential (and challenges) in medicine and life science
Why? An explainer, by Su-In Lee and me
thelancet.com/journals/lance…

🆕 @TheLancet Counterfactual #AI models have exciting potential (and challenges) in medicine and life science Why? An explainer, by @suinleelab and me thelancet.com/journals/lance…

thumb_up_off_alt171

chat_bubble_outline0

account_circle

Simon Shaolei Du

@SimonShaoleiDu

3 months ago

A TWO-PLAYER system for online RL fine-tuning.

thumb_up_off_alt11

chat_bubble_outline0

account_circle

Simon Shaolei Du

@SimonShaoleiDu

3 months ago

Honored to become a 2024 #SloanFellow . Thanks to Sloan Foundation and to all my students and collaborators for their amazing work!

thumb_up_off_alt240

chat_bubble_outline0

account_circle

Allen School

4 months ago

“The problem with large language models is that they’re large!” In this UW-IT News story, #UWAllen ’s Tim Dettmers explains QLora, the Madrona prize-winning tool from UW NLP researchers that enables you to take an LLM and 'make it your own”: itconnect.uw.edu/making-languag… 1/2

thumb_up_off_alt8

chat_bubble_outline0

account_circle

Rob Nowak

4 months ago

New paper on label-efficient supervised finetuning of LLMs.

We address the expensive prompt annotation cost by humans/proprietary LLMs, saving as much as 50% on FLAN V2.

Paper: arxiv.org/abs/2401.06692
Work led by: Jifan Zhang Yifang Chen Gantavya Bhatt Arnav Das
1/

thumb_up_off_alt149

chat_bubble_outline0

account_circle

Simon Shaolei Du

@SimonShaoleiDu

5 months ago

Why Decision Transformer? It doesn't require the Bellman Completeness -- a strong assumption needed by Q-learning

thumb_up_off_alt60

chat_bubble_outline0

account_circle

Tim Althoff

5 months ago

I'm recruiting PhD students Allen School UW NLP (bdata.uw.edu). Focus areas include Human-AI collaboration, language agents, LLM safety & applications to mental health, social sciences, education.

Apply here: cs.washington.edu/academics/phd/…

UW Data Science
@UW_iSchool
HCI & Design at UW

I'm recruiting PhD students @uwcse @uwnlp (bdata.uw.edu). Focus areas include Human-AI collaboration, language agents, LLM safety & applications to mental health, social sciences, education. Apply here: cs.washington.edu/academics/phd/… @uwdatascience @UW_iSchool @uwdub

thumb_up_off_alt288

chat_bubble_outline0

account_circle

Simon Shaolei Du

@SimonShaoleiDu

5 months ago

Zihan will present our work on optimal sample complexity for reinforcement learning: arxiv.org/abs/2307.13586

thumb_up_off_alt18

chat_bubble_outline0

account_circle

Simon Shaolei Du

@SimonShaoleiDu

6 months ago

We proved Q* is hard 4 years ago 🤷‍♂️

thumb_up_off_alt86

chat_bubble_outline0

account_circle

Allen School

7 months ago

#UWAllen is hiring! Join our outstanding scholarly community at University of Washington shaping the future of computing—and having fun while doing it! Priority will be given to faculty applications received by Nov. 13 (teaching track) & Nov. 15 (tenure track). Please share! cs.washington.edu/faculty_candid…

#UWAllen is hiring! Join our outstanding scholarly community at @UW shaping the future of computing—and having fun while doing it! Priority will be given to faculty applications received by Nov. 13 (teaching track) & Nov. 15 (tenure track). Please share! cs.washington.edu/faculty_candid…

thumb_up_off_alt20

chat_bubble_outline0

account_circle

Yuandong Tian

7 months ago

Now Scan&Snap has a follow-up!

1/ We introduce JoMA (arxiv.org/abs/2310.00535), a joint dynamics for MLP lower and self-Attention layers, in order to better understand (1) how multilayer Transformer with MLP nonlinearity works, and (2) qualitatively explain how hierarchical

thumb_up_off_alt63

chat_bubble_outline0

account_circle

Abhishek Gupta

@abhishekunique7

8 months ago

Want to get model-based RL to work in diverse, dynamic scenes? Check out Chuning Zhu's latest work (RePo) on model-based reinforcement learning without reconstruction, where we show how to learn world models that scale to dynamic, multi-task environments. A 🧵(1/6)

thumb_up_off_alt94

chat_bubble_outline0

account_circle

Su-In Lee

11 months ago

Super exciting work by Allen School's Sheng Wang's lab published in Nature Machine Intelligence Nature Machine Intelligence! 👇
nature.com/articles/s4225…

thumb_up_off_alt28

chat_bubble_outline0

account_circle

Jifan Zhang

11 months ago

Introducing our framework and benchmarks for label-efficient learning.

Evaluations of large pretrained models, Semi-SL and active learning have mostly stayed isolated. LabelBench combines all these mutually beneficial techniques to examine the best possible label-efficiency 1/

Introducing our framework and benchmarks for label-efficient learning. Evaluations of large pretrained models, Semi-SL and active learning have mostly stayed isolated. LabelBench combines all these mutually beneficial techniques to examine the best possible label-efficiency 1/

thumb_up_off_alt67

chat_bubble_outline0

account_circle

Simon Shaolei Du

@SimonShaoleiDu

11 months ago

Our attempt to open the black box of 1-layer transformer training dynamics.

thumb_up_off_alt36

chat_bubble_outline0

account_circle

fpc ok :)