merve (@mervenoyann) 's Twitter Profile
merve

@mervenoyann

open-sourceress at @huggingface 🧙🏻‍♀️ proud Mediterrenean 🍋 I work on zero-shot vision & VLMs

ID: 1202267633049100291

linkhttp://huggingface.co/merve calendar_today04-12-2019 16:45:25

24,24K Tweet

60,60K Followers

4,4K Following

Benjamin Clavié (@bclavie) 's Twitter Profile Photo

RAG is increasingly going multi-modal, but document retrieval is tough, and layout gets in your way. But it shouldn't! Introducing 🪤RAGatouille's Vision-equipped, ColPali-powered sibling: 🐭Byaldi With just a few lines of code, search through documents, with no pre-processing.

RAG is increasingly going multi-modal, but document retrieval is tough, and layout gets in your way. But it shouldn't!

Introducing 🪤RAGatouille's Vision-equipped, ColPali-powered sibling: 🐭Byaldi

With just a few lines of code, search through documents, with no pre-processing.
Niels Rogge (@nielsrogge) 's Twitter Profile Photo

New model alert! 🔥LLaVa-OneVision is now in the Transformers library A powerful series (0.5B/7B/72/B) for single, multi-image, and video scenarios. Successor of LLaVa-NeXT. SOTA open model on Video-MME: video-mme.github.io/home_page.html… Definitely worth a look besides Qwen2-VL 1/2

New model alert! 🔥LLaVa-OneVision is now in the Transformers library

A powerful series (0.5B/7B/72/B) for single, multi-image, and video scenarios. Successor of LLaVa-NeXT. 

SOTA open model on Video-MME: video-mme.github.io/home_page.html…

Definitely worth a look besides Qwen2-VL

1/2
merve (@mervenoyann) 's Twitter Profile Photo

I really want my own local Qwen-2-VL, there goes my weekend to benchmark different quantization methods on different checkpoints 🤝🏻 will likely post my journey here, wish me good luck 🍀

Jo Kristian Bergum (@jobergum) 's Twitter Profile Photo

A new Vespa + ColPali notebook just dropped! We demonstrate how to scale ColPali (and MaxSim) to large collections of PDF pages. - HNSW index over binary patch embeddings - Efficient candidate retrieval over HNSW - Set of re-ranking steps from coarse to finer precision

A new Vespa + ColPali notebook just dropped!

We demonstrate how to scale ColPali (and MaxSim) to large collections of PDF pages. 

- HNSW index over binary patch embeddings
- Efficient candidate retrieval over HNSW
- Set of re-ranking steps from coarse to finer precision
Jason Ramapuram (@jramapuram) 's Twitter Profile Photo

Enjoy attention? Want to make it ~18% faster? Try out Sigmoid Attention. We replace the traditional softmax in attention with a sigmoid and a constant (not learned) scalar bias based on the sequence length. Paper: arxiv.org/abs/2409.04431 Code: github.com/apple/ml-sigmo… This was

Enjoy attention? Want to make it ~18% faster? Try out Sigmoid Attention. We replace the traditional softmax in attention with a sigmoid and a constant (not learned) scalar bias based on the sequence length.

Paper: arxiv.org/abs/2409.04431
Code: github.com/apple/ml-sigmo…

This was