Zhuyun Dai (@zhuyundai) Twitter Tweets • TwiCopy

Zhuyun Dai

3 years ago

Check out our new work, RARR: Attributed Text Generation via Post-hoc Research and Revision arxiv.org/abs/2210.08726 When applied to the output of LLMs on a diverse set of generation tasks, RARR significantly improves attribution while otherwise preserving the original input.

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

Quoc Le

@quocleix

3 years ago

New open-source language model from Google AI: Flan-T5 🍮 Flan-T5 is instruction-finetuned on 1,800+ language tasks, leading to dramatically improved prompting and multi-step reasoning abilities. Public models: bit.ly/3sbNPDJ Paper: arxiv.org/abs/2210.11416

thumb_up_off_alt2,2K

chat_bubble_outline37

repeat473

shareShare

Vincent Y. Zhao

@zyzzhaoyuzhe

3 years ago

New from Google Research! We advance multi-vector neural retrieval with AligneR, which sparsely aligns query and doc tokens. AligneR can adapt alignment for new tasks using just 8 examples, advancing SOTA and 10x faster than prior multi-vector models. arxiv.org/abs/2211.01267

thumb_up_off_alt226

chat_bubble_outline4

repeat52

shareShare

Omar Khattab

@lateinteraction

3 years ago

This is an *amazing* way to re-engineer the scoring mechanism of late interaction / ColBERT retrievers! Instead of gathering all vectors in each retrieved document, they approximate missing vector scores via an upper bound per query token—and modify the objective fn accordingly.

thumb_up_off_alt146

chat_bubble_outline1

repeat24

shareShare

Zhuyun Dai

@zhuyundai

3 years ago

Check out our new paper, XTR. XTR is a new multi-vector retrieval model that achieves new sota AND is easy to use!

thumb_up_off_alt47

chat_bubble_outline3

repeat6

shareShare

taolei

@taolei15949106

3 years ago

Introducing Conditional Adapters (CoDA) from Google Research! Adaptation methods (e.g. Adapter and LoRA) can finetune LMs with minimal parameter updates, but their inference remains expensive. CoDA makes LMs faster to use, and works for three modalities! arxiv.org/abs/2304.04947

thumb_up_off_alt247

chat_bubble_outline4

repeat64

shareShare

Google DeepMind

@googledeepmind

3 years ago

We’re proud to announce that DeepMind and the Brain team from Google Research will become a new unit: 𝗚𝗼𝗼𝗴𝗹𝗲 𝗗𝗲𝗲𝗽𝗠𝗶𝗻𝗱. Together, we'll accelerate progress towards a world where AI can help solve the biggest challenges facing humanity. → dpmd.ai/google-deepmind

thumb_up_off_alt2,2K

chat_bubble_outline105

repeat493

shareShare

Jimmy Lin

@lintool

2 years ago

Just how good are commercially available embedding APIs for vector search? An effort led by Ehsan Kamalloo evaluated a few of them - OpenAI @CohereAI Aleph Alpha - on BEIR and MIRACL... Check out the results! arxiv.org/abs/2305.06300 - forthcoming #ACL2023 industry track paper

thumb_up_off_alt178

chat_bubble_outline2

repeat36

shareShare

Zhuyun Dai

@zhuyundai

2 years ago

So excited to see our recent work launched on Google Bard (bard.google.com)! The “Goggle it” button double-checks claims made by LLM, provide relevant sources, and finds “hallucinated” information.

thumb_up_off_alt34

chat_bubble_outline0

repeat1

shareShare

Jeremy R Cole

@jeremy_r_cole

2 years ago

I'm in Singapore for EMNLP! I'll be presenting our work "Selectively Answering Ambiguous Questions." arxiv.org/abs/2305.14613 Our goal here was to try to decouple uncertainty about the question from uncertainty about the answer, using a selective question answering approach.

thumb_up_off_alt24

chat_bubble_outline1

repeat4

shareShare

Jeff Dean

@jeffdean

2 years ago

Gemini 1.5 Pro - A highly capable multimodal model with a 10M token context length Today we are releasing the first demonstrations of the capabilities of the Gemini 1.5 series, with the Gemini 1.5 Pro model. One of the key differentiators of this model is its incredibly long

thumb_up_off_alt6,6K

chat_bubble_outline188

repeat1,1K

shareShare

Zhuyun Dai

@zhuyundai

2 years ago

I’m pleased to announce Gecko 🦎, a new text embedding model developed at Google DeepMind and now available on Google Cloud ! Gecko is powered by an LLM distillation recipe, and is one step towards our goal to bridge LLM and retrievers. Promptagator 🐊, Gecko🦎, what’s next?

thumb_up_off_alt41

chat_bubble_outline2

repeat8

shareShare

Aran Komatsuzaki

@arankomatsuzaki

a year ago

Google presents Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More? Long-context LM: - Often rivals SotA retrieval and RAG systems - But still struggles with areas like compositional reasoning repo: github.com/google-deepmin… abs: arxiv.org/abs/2406.13121

thumb_up_off_alt329

chat_bubble_outline3

repeat86

shareShare

Sebastian Riedel (@[email protected])

@riedelcastro

a year ago

"just put the corpus into the context"! Long context models can already match or beat various bespoke pipelines and infra in accuracy on non-trivial tasks! Hadn't expected this so soon, and honestly was hoping to milk RAG impact for a little longer 🤪

thumb_up_off_alt49

chat_bubble_outline4

repeat17

shareShare

Zhuyun Dai

@zhuyundai

8 months ago

Very cool results from the Gemini Embedding model!

thumb_up_off_alt22

chat_bubble_outline2

repeat0

shareShare