UW RAIVN Lab (@raivnlab) 's Twitter Profile
UW RAIVN Lab

@raivnlab

The computer vision and reasoning lab in the Allen School at the University of Washington, led by Ali Farhadi and Ranjay Krishna.

ID: 1248738274438344704

linkhttps://raivn.cs.washington.edu calendar_today10-04-2020 22:23:10

166 Tweet

818 Followers

143 Following

UW RAIVN Lab (@raivnlab) 's Twitter Profile Photo

New CLIP model achieving 80.1% zero-shot ImageNet top1 from members of RAIVN lab, LAION and StabilityAI. This uses two prior methods from RAIVN: 1. Model soups, @mitchnw et al (arxiv.org/abs/2203.05482) 2. CuPL prompts (for 80.3%), Sarah Pratt et al (arxiv.org/abs/2209.03320)

Mitchell Wortsman (@mitchnw) 's Twitter Profile Photo

Sharing our project on 1) accelerating and 2) stabilizing training for large language-vision models 1) Towards accelerating training, we introduce SwitchBack, a linear layer for int8 quantized training which matches bfloat16 within 0.1 for CLIP ViT-Huge arxiv.org/abs/2304.13013

Sharing our project on 1) accelerating and 2) stabilizing training for large language-vision models

1) Towards accelerating training, we introduce SwitchBack, a linear layer for int8 quantized training which matches bfloat16 within 0.1 for CLIP ViT-Huge

arxiv.org/abs/2304.13013
Gabriel Ilharco (@gabriel_ilharco) 's Twitter Profile Photo

Introducing DataComp, a new benchmark for multimodal datasets! We release 12.8B image-text pairs, 300+ experiments and a 1.4B subset that outcompetes compute-matched CLIP runs from OpenAI & LAION 📜 arxiv.org/abs/2304.14108 🖥️ github.com/mlfoundations/… 🌐 datacomp.ai

Introducing DataComp, a new benchmark for multimodal datasets! 

We release 12.8B image-text pairs, 300+ experiments and a 1.4B subset that outcompetes compute-matched CLIP runs from OpenAI & LAION

📜 arxiv.org/abs/2304.14108
🖥️ github.com/mlfoundations/…
🌐 datacomp.ai
Vaishaal Shankar (@vaishaal) 's Twitter Profile Photo

1/9 I am excited to announce that our workshop "Towards the Next Generation of Computer Vision Datasets" will be happening at ICCV 2023 in Paris. We will feature DataComp submissions, other data-centric papers, and invited talks by experts. datacomp.ai/workshop

Aditya Kusupati (@adityakusupati) 's Twitter Profile Photo

Introducing💃AdANNS: A Framework for Adaptive Semantic Search🕺 TL;DR: Up to 90× faster nearest neighbor retrieval and 2× lower memory cost for web-scale search. Applies to vector search at scale & improves all "retrieval" augmented models! arxiv.org/abs/2305.19435 [1/8]

Introducing💃AdANNS: A Framework for Adaptive Semantic Search🕺

TL;DR: Up to 90× faster nearest neighbor retrieval and 2× lower memory cost for web-scale search.

Applies to vector search at scale & improves all "retrieval" augmented models!

arxiv.org/abs/2305.19435

[1/8]
UW RAIVN Lab (@raivnlab) 's Twitter Profile Photo

If you are at #CVPR2023, come check out prompting-in-vision.github.io on Monday, June 19 from 9am - 12pm in West room 223-224. Speakers include Sarah Pratt from RAIVN lab as well as Ziwei Liu Phillip Isola Hyojin Bahng Ludwig Schmidt and Denny Zhou!

Mitchell Wortsman (@mitchnw) 's Twitter Profile Photo

Sharing some highlights from our work on small-scale proxies for large-scale Transformer training instabilities: arxiv.org/abs/2309.14322 With fantastic collaborators Peter J. Liu, Lechao Xiao, Katie Everett, many others (see final tweet!), Jaehoon Lee, Justin Gilmer, Simon Kornblith! (1/15)

Sharing some highlights from our work on small-scale proxies for large-scale Transformer training instabilities: arxiv.org/abs/2309.14322

With fantastic collaborators <a href="/peterjliu/">Peter J. Liu</a>, <a href="/Locchiu/">Lechao Xiao</a>, <a href="/_katieeverett/">Katie Everett</a>, many others (see final tweet!), <a href="/hoonkp/">Jaehoon Lee</a>, <a href="/jmgilmer/">Justin Gilmer</a>, <a href="/skornblith/">Simon Kornblith</a>!

(1/15)
Mitchell Wortsman (@mitchnw) 's Twitter Profile Photo

E) The attention logit growth instability is still present when replacing softmax with pointwise alternatives. Side note: If you're interested in learning more about replacing softmax with a pointwise alternative like relu^2/√seqlen, checkout arxiv.org/abs/2309.08586! (12/15)

E) The attention logit growth instability is still present when replacing softmax with pointwise alternatives.

Side note: If you're interested in learning more about replacing softmax with a pointwise alternative like relu^2/√seqlen, checkout arxiv.org/abs/2309.08586!

(12/15)
UW RAIVN Lab (@raivnlab) 's Twitter Profile Photo

Check out🪆MatFormer🪆co-led by Aditya Kusupati: it’s a simple yet powerful general-purpose architecture with flexibility and elasticity built within. It works across modalities and enables super cool things at web-scale tasks🔥🔥

Ainaz Eftekhar (@ainaz_eftekhar) 's Twitter Profile Photo

🎉 Very Excited to present our recent work on “Selective🔍 Visual Representations for Embodied-AI🤖” next week at ICLR in Vienna🇦🇹!! 📣📣Important update! Our code and pretrained models are now available through our project website 🌐: embodied-codebook.github.io🚀 👋Come to my