Dimitris Tsipras (@tsiprasd) Twitter Tweets • TwiCopy

Joon Sung Park

3 years ago

How might an online community look after many people join? My paper w/ lindsay popowski @Carryveggies Meredith Ringel Morris Percy Liang Michael Bernstein introduces "social simulacra": a method of generating compelling social behaviors to prototype social designs 🧵 arxiv.org/abs/2208.04024 #uist2022

thumb_up_off_alt243

chat_bubble_outline11

repeat46

shareShare

Xiang Lisa Li

@xianglisali2

3 years ago

arxiv.org/abs/2210.15097 We propose contrastive decoding (CD), a more reliable search objective for text generation by contrasting LMs of different sizes. CD takes a large LM (expert LM e.g. OPT-13b) and a small LM (amateur LM e.g. OPT-125m) and maximizes their logprob difference

thumb_up_off_alt698

chat_bubble_outline8

repeat119

shareShare

Percy Liang

@percyliang

3 years ago

Language models are becoming the foundation of language technologies, but when do they work or don’t work? In a new CRFM paper, we propose Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of LMs. Holistic evaluation includes three elements:

thumb_up_off_alt763

chat_bubble_outline14

repeat198

shareShare

Aleksander Madry

@aleks_madry

3 years ago

You’re deploying an ML system, choosing between two models trained w/ diff algs. Same training data, same acc... how do you differentiate their behavior? ModelDiff (gradientscience.org/modeldiff) lets you compare *any* two learning algs! w/ Harshay Shah Sam Park Andrew Ilyas (1/8)

thumb_up_off_alt292

chat_bubble_outline4

repeat65

shareShare

Dimitris Tsipras

@tsiprasd

3 years ago

Our #NeurIPS2022 poster on in-context learning will be tomorrow (Thursday) at 4pm! Come talk to Shivam Garg and me at poster #928 🔥

thumb_up_off_alt37

chat_bubble_outline0

repeat6

shareShare

Aleksander Madry

@aleks_madry

3 years ago

Stable diffusion can visualize + improve model failure modes! Leveraging our method, we can generate examples of hard subpopulations, which can then be used for targeted data augmentation to improve reliability. Blog: gradientscience.org/failure-direct… Saachi Jain Hannah Lawrence A.Moitra

thumb_up_off_alt68

chat_bubble_outline0

repeat16

shareShare

Percy Liang

@percyliang

3 years ago

📣 CRFM announces PubMedGPT, a new 2.7B language model that achieves a new SOTA on the US medical licensing exam. The recipe is simple: a standard Transformer trained from scratch on PubMed (from The Pile) using @mosaicml on the MosaicML Cloud, then fine-tuned for the QA task.

thumb_up_off_alt1,1K

chat_bubble_outline41

repeat321

shareShare

Aleksander Madry

@aleks_madry

3 years ago

Recent events (ahem) have brought the debate on whether/how to regulate social media back to the forefront. My students Sarah Cen Andrew Ilyas and I have been thinking about this for a *while*. Excited to share the first results of our thinking: aipolicy.substack.com/p/socialmedias… (1/3)

thumb_up_off_alt35

chat_bubble_outline1

repeat13

shareShare

Percy Liang

@percyliang

3 years ago

Announcing Holistic Evaluation of Language Models (HELM) v0.2.0 with updated results on the new OpenAI, AI21 Labs, and @CohereAI models. HELM now evaluates 34 prominent language models in a standardized way on 42 scenarios x 7 metrics.

thumb_up_off_alt550

chat_bubble_outline4

repeat90

shareShare

Dimitris Papailiopoulos

@dimitrispapail

3 years ago

Can transformers follow instructions? We explore this in: "Looped Transformers as Programmable Computers" arxiv.org/abs/2301.13196 led by Angeliki (Angeliki Giannou) and Shashank (Shashank Rajput) in collaboartion with the Kangwook Lee and Jason Lee Here is a 🧵

thumb_up_off_alt791

chat_bubble_outline17

repeat150

shareShare

John Hewitt

@johnhewtt

3 years ago

For this year's CS 224n: Natural Language Processing with Deep Learning, I've written notes on our Self-Attention and Transformers lecture. web.stanford.edu/class/cs224n/r… Topics: Problems with RNNs, then self-attention, then a 'minimal' self-attention architecture, then Transformers.

thumb_up_off_alt760

chat_bubble_outline4

repeat151

shareShare

Sang Michael Xie

@sangmichaelxie

3 years ago

Data selection for LMs (GPT-3, PaLM) is done with heuristics that select data by training a classifier for high-quality text. Can we do better? Turns out we can boost downstream GLUE acc by 2+% by adapting the classic importance resampling algorithm.. arxiv.org/abs/2302.03169 🧵

thumb_up_off_alt336

chat_bubble_outline4

repeat58

shareShare

Hannah Li

@hannahq_li

2 years ago

Aleksander Madry testifying at the Congressional subcommittee on AI technology! That's my postdoc advisor!! oversight.house.gov/hearing/advanc…

<a href="/aleks_madry/">Aleksander Madry</a> testifying at the Congressional subcommittee on AI technology! That's my postdoc advisor!!

oversight.house.gov/hearing/advanc…

thumb_up_off_alt32

chat_bubble_outline0

repeat5

shareShare

Tatsunori Hashimoto

@tatsu_hashimoto

2 years ago

We know that language models (LMs) reflect opinions - from internet pre-training, to developers and crowdworkers, and even user feedback. But whose opinions actually appear in the outputs? We make LMs answer public opinion polls to find out: arxiv.org/abs/2303.17548

thumb_up_off_alt411

chat_bubble_outline4

repeat96

shareShare

Dimitris Tsipras

@tsiprasd

2 years ago

💜

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Dimitris Tsipras

@tsiprasd

2 years ago

OpenAI is nothing without its people

thumb_up_off_alt27

chat_bubble_outline0

repeat2

shareShare

OpenAI

@openai

2 years ago

We have reached an agreement in principle for Sam Altman to return to OpenAI as CEO with a new initial board of Bret Taylor (Chair), Larry Summers, and Adam D'Angelo. We are collaborating to figure out the details. Thank you so much for your patience through this.

thumb_up_off_alt64,64K

chat_bubble_outline5,5K

repeat12,12K

shareShare

Dimitris Tsipras

@tsiprasd

2 years ago

oh we so back 💜

thumb_up_off_alt15

chat_bubble_outline0

repeat1

shareShare