WAIL: ML at UW (@uw_wail) Twitter Tweets • TwiCopy

Samuel "curry-howard fanboi" Ainsworth

4 years ago

Welp, it finally happened... I (successfully) defended my PhD thesis and today I submitted the final revisions to my dissertation! 🎓 Officially a PhD graduate of University of Washington Allen School WAIL: ML at UW!

thumb_up_off_alt103

chat_bubble_outline13

repeat3

shareShare

Tim Dettmers

@tim_dettmers

4 years ago

We release LLM.int8(), the first 8-bit inference method that saves 2x memory and does not degrade performance for 175B models by exploiting emergent properties. Read More: Paper: arxiv.org/abs/2208.07339 Software: huggingface.co/blog/hf-bitsan… Emergence: timdettmers.com/2022/08/17/llm…

thumb_up_off_alt1,1K

chat_bubble_outline17

repeat260

shareShare

WAIL: ML at UW

@uw_wail

4 years ago

my boi Ofir Press is KILLING it out here!! ya'lls position embeddings never stood a chance

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Sarah Pratt

@sarahmhpratt

4 years ago

Instead of prompting CLIP w/ "A photo of a {class}", why not ask GPT-3 to describe {class} instead? The result is customized prompts for each {class}! Less human effort, higher zero-shot accuracy Work w/ Rosanne Liu (Rosanne Liu) + Ali Farhadi arxiv.org/abs/2209.03320 (1/4)

thumb_up_off_alt417

chat_bubble_outline9

repeat67

shareShare

Samuel "curry-howard fanboi" Ainsworth

@samuelainsworth

4 years ago

📜🚨📜🚨 NN loss landscapes are full of permutation symmetries, ie. swap any 2 units in a hidden layer. What does this mean for SGD? Is this practically useful? For the past 5 yrs these Qs have fascinated me. Today, I am ready to announce "Git Re-Basin"! arxiv.org/abs/2209.04836

thumb_up_off_alt2,2K

chat_bubble_outline61

repeat571

shareShare

Ofir Press

@ofirpress

3 years ago

We've found a new way to prompt language models that improves their ability to answer complex questions Our Self-ask prompt first has the model ask and answer simpler subquestions. This structure makes it easy to integrate Google Search into an LM. Watch our demo with GPT-3 🧵⬇️

thumb_up_off_alt1,1K

chat_bubble_outline49

repeat302

shareShare

Raghav Somani

@somaniraghav

3 years ago

Observation: Permutation symmetry in deep Neural Networks. Permute neurons in any layer & output stays the same. Question: Does this symmetry help our understanding and analysis of optimization algorithms as the size of the network grows? Is there a scaling limit? A 🧵...

thumb_up_off_alt22

chat_bubble_outline1

repeat4

shareShare

Christoforos Mavrogiannis

@mavrojean

3 years ago

Super excited to share that I'll be joining the Michigan Robotics Department University of Michigan as an Assistant Professor in Fall 2023! My lab will focus on building interactive autonomy for #robots working with and around people. Spread the word for interested students and collaborators 😎

Super excited to share that I'll be joining the <a href="/UMRobotics/">Michigan Robotics</a> Department <a href="/UMich/">University of Michigan</a> as an Assistant Professor in Fall 2023! My lab will focus on building interactive autonomy for #robots working with and around people. Spread the word for interested students and collaborators 😎

thumb_up_off_alt226

chat_bubble_outline25

repeat15

shareShare

Inna Lin

@iwylin

3 years ago

Super excited that our paper "Gendered Mental Health Stigma in Masked Language Models" was accepted to #EMNLP!! 🥳Pre-print coming soon. Joint work with Lucille Njoo (my amazing co-first author), Anjalie Field, Ashish Sharma, Katharina Reinecke, Tim Althoff, & Yulia Tsvetkov💜

thumb_up_off_alt85

chat_bubble_outline3

repeat6

shareShare

Tim Dettmers

@tim_dettmers

3 years ago

Release 0.35 of bitsandbytes brings CUDA 11.8 to the library, making it more straightforward to fine-tune #stablediffusion Dreambooth on 12 GB Colab! At this point, bnb has been pip installed more than 100k times. Thanks for all your support and bug reports!

thumb_up_off_alt83

chat_bubble_outline1

repeat11

shareShare

Ofir Press

@ofirpress

3 years ago

As language models grow in size they know more, but do they get better at reasoning? To test GPT-3, we generated lots of questions such as "What is the calling code of the birthplace of Adele?". We show that as GPT size grows, it does not improve its compositional abilities🧵⬇️

thumb_up_off_alt566

chat_bubble_outline16

repeat90

shareShare

Samuel "curry-howard fanboi" Ainsworth

@samuelainsworth

3 years ago

New week, new edition of Git Re-Basin! New algorithms, experiments and more! A quick 🧵: arxiv.org/abs/2209.04836

thumb_up_off_alt212

chat_bubble_outline3

repeat29

shareShare

MacArthur Foundation

@macfound

3 years ago

Computer Scientist Yejin Choi uses natural language processing to develop AI systems that can understand language and make inferences about the world. Learn more about the 2022 MacArthur Fellow #MacFellow macfound.org/fellows/class-…

thumb_up_off_alt548

chat_bubble_outline16

repeat79

shareShare

Allen School

@uwcse

3 years ago

#UWAllen UW NLP's Yejin Choi aims to develop #AI with the ability to reason and communicate about the world in physical and abstract terms, like humans can do. As a 2022 #MacFellow, she looks forward to taking the “adventurous route” in her research: news.cs.washington.edu/2022/10/12/go-…

#UWAllen <a href="/uwnlp/">UW NLP</a>'s <a href="/YejinChoinka/">Yejin Choi</a> aims to develop #AI with the ability to reason and communicate about the world in physical and abstract terms, like humans can do. As a 2022 #MacFellow, she looks forward to taking the “adventurous route” in her research: news.cs.washington.edu/2022/10/12/go-…

thumb_up_off_alt373

chat_bubble_outline8

repeat63

shareShare

Gian Marco Visani

@gmarcovisani

3 years ago

Happy to share that our work on rotation-equivariant representation learning for spherical and 3D data has been accepted at MLSB @ NeurIPS! Excited to come discuss how symmetry-aware DL may help us characterize protein function from structure. Preprint: doi.org/10.1101/2022.0…

Happy to share that our work on rotation-equivariant representation learning for spherical and 3D data has been accepted at <a href="/workshopmlsb/">MLSB @ NeurIPS</a>!

Excited to come discuss how symmetry-aware DL may help us characterize protein function from structure.

Preprint: doi.org/10.1101/2022.0…

thumb_up_off_alt22

chat_bubble_outline0

repeat6

shareShare

Tim Dettmers

@tim_dettmers

3 years ago

Catch my keynote on 8-bit Methods for Efficient Deep Learning, today at 4:35pm, ballroom C. Besides my work on 8-bit, I will also give a sneak peek into my latest project: Bit-level scaling laws for zeroshot inference. An analysis of 35,000 zeroshot experiments #NeurIPS

thumb_up_off_alt63

chat_bubble_outline1

repeat9

shareShare

Samuel "curry-howard fanboi" Ainsworth

@samuelainsworth

3 years ago

ok so diffusion models are taking over the world... Tune in twitch.tv/skainswo tomorrow 12/3 @ 2pm PST to join me in implementing one in #JAX!

thumb_up_off_alt14

chat_bubble_outline1

repeat1

shareShare

Melanie Sclar

@melaniesclar

3 years ago

LLMs lack robust theory of mind skills, but there are no diverse large-scale datasets for direct training. How can we overcome this? Meet SymbolicToM: a plug-and-play method to boost theory of mind reasoning in language models using explicit graphical representations!✨ #ACL2023

thumb_up_off_alt194

chat_bubble_outline6

repeat48

shareShare

Krishna Pillutla

@krishnapillutla

2 years ago

I’m thrilled to announce that I'll be joining IIT Madras as an Assistant Professor in April 2024! I’m immensely grateful to my amazing mentors, family, and friends for their unwavering support. (1/4)

thumb_up_off_alt1,1K

chat_bubble_outline49

repeat56

shareShare

Samuel "curry-howard fanboi" Ainsworth

@samuelainsworth

2 years ago

Announcing bitbop.io! Run `ssh bitbop.io`, get your own GPU dev machine 🤖

thumb_up_off_alt88

chat_bubble_outline5

repeat18

shareShare