merve (@mervenoyann) Twitter Tweets • TwiCopy

merve

@mervenoyann

+ Follow

open-sourceress at @huggingface 🧙🏻‍♀️ proud Mediterrenean 🍋 I work on zero-shot vision & VLMs

ID: 1202267633049100291

linkhttp://huggingface.co/merve calendar_today04-12-2019 16:45:25

24,24K Tweet

60,60K Followers

4,4K Following

Benjamin Clavié

@bclavie

18 days ago

RAG is increasingly going multi-modal, but document retrieval is tough, and layout gets in your way. But it shouldn't! Introducing 🪤RAGatouille's Vision-equipped, ColPali-powered sibling: 🐭Byaldi With just a few lines of code, search through documents, with no pre-processing.

thumb_up_off_alt690

chat_bubble_outline19

repeat106

shareShare

Junyang Lin

@justinlin610

18 days ago

💗💗💗

thumb_up_off_alt20

chat_bubble_outline1

repeat3

shareShare

Niels Rogge

@nielsrogge

18 days ago

New model alert! 🔥LLaVa-OneVision is now in the Transformers library A powerful series (0.5B/7B/72/B) for single, multi-image, and video scenarios. Successor of LLaVa-NeXT. SOTA open model on Video-MME: video-mme.github.io/home_page.html… Definitely worth a look besides Qwen2-VL 1/2

xjdr

@_xjdr

18 days ago

These should probably be the foundation of your new multimodal RAG stack

thumb_up_off_alt39

chat_bubble_outline0

repeat4

shareShare

merve

@mervenoyann

17 days ago

I really want my own local Qwen-2-VL, there goes my weekend to benchmark different quantization methods on different checkpoints 🤝🏻 will likely post my journey here, wish me good luck 🍀

thumb_up_off_alt253

chat_bubble_outline19

repeat8

shareShare

Jo Kristian Bergum

@jobergum

16 days ago

A new Vespa + ColPali notebook just dropped! We demonstrate how to scale ColPali (and MaxSim) to large collections of PDF pages. - HNSW index over binary patch embeddings - Efficient candidate retrieval over HNSW - Set of re-ranking steps from coarse to finer precision

merve

@mervenoyann

16 days ago

finally the last notebook bender Niels Rogge is trying my notebook and not the other way around we made it fam

thumb_up_off_alt86

chat_bubble_outline5

repeat4

shareShare

Jason Ramapuram

@jramapuram

14 days ago

Enjoy attention? Want to make it ~18% faster? Try out Sigmoid Attention. We replace the traditional softmax in attention with a sigmoid and a constant (not learned) scalar bias based on the sequence length. Paper: arxiv.org/abs/2409.04431 Code: github.com/apple/ml-sigmo… This was

thumb_up_off_alt658

chat_bubble_outline13

repeat127

shareShare