Michael Günther (@michael_g_u) Twitter Tweets • TwiCopy

Michael Günther

4 months ago

I went together with Bo to SIGIR this year, we wrote a blog post with our highlights and summaries of AI and neural papers that we found interesting at the conference jina.ai/news/what-we-l…

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Two weeks ago, we released jina-embeddings-v4-GGUF with dynamic quantizations. During our experiments, we found interesting things while converting and running GGUF embeddings. Since most of the llama.cpp community focuses on LLMs, we thought it'd be valuable to share this from

thumb_up_off_alt174

chat_bubble_outline5

repeat20

shareShare

Jina AI

@jinaai_

3 months ago

Got a Mac with an M-chip? You can now train Gemma3 270m locally as a multilingual embedding or reranker model using our mlx-retrieval project. It lets you train Gemma3 270m locally at 4000 tokens/s on M3 Ultra - that's actually usable speed. We've implemented some standard

thumb_up_off_alt422

chat_bubble_outline7

repeat65

shareShare

Michael Günther

@michael_g_u

3 months ago

We are at Qdrant 's Vector Space Day 🚀 in Berlin on Sep 26. We'll talk about "Vision-Language Models: A New Architecture for Multi-Modal Embedding Models" and also share some insights and learnings we gained while training jina-embeddings-v4. 🎫 lu.ma/p7w9uqtz

We are at <a href="/qdrant_engine/">Qdrant</a> 's Vector Space Day 🚀 in Berlin on Sep 26. We'll talk about "Vision-Language Models: A New Architecture for Multi-Modal Embedding Models" and also share some insights and learnings we gained while training jina-embeddings-v4.
🎫 lu.ma/p7w9uqtz

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Jina AI

@jinaai_

3 months ago

Today we're releasing jina-code-embeddings, a new suite of code embedding models in two sizes—0.5B and 1.5B parameters—along with 1~4bit GGUF quantizations for both. Built on latest code generation LLMs, these models achieve SOTA retrieval performance despite their compact size.

thumb_up_off_alt313

chat_bubble_outline8

repeat51

shareShare

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

3 months ago

mmBERT: Massively Multilingual BERT Trained on 3T+ tokens across 1,833 languages, mmBERT surpasses XLM-R on standard NLU and retrieval benchmarks and is competitive with English-only encoders; in throughput tests it runs 2–4× faster than prior multilingual encoders under

thumb_up_off_alt55

chat_bubble_outline3

repeat10

shareShare

Jina AI

@jinaai_

3 months ago

V4 is multimodal embeddings, but V4-GGUF wasn't—until now. We've finally cracked how to generate multimodal embeddings using llama.cpp & GGUF. We fixed two main issues. First, in the language model part, we corrected the attention mask in the transformer block so it properly

thumb_up_off_alt173

chat_bubble_outline7

repeat32

shareShare

Jina AI

@jinaai_

2 months ago

Last but not late: jina-reranker-v3 is here! A new 0.6B-parameter listwise reranker that puts query and all candidate documents in one context window and SOTA on BEIR. We call this new query-document interaction "last but not late" - It's "last" because <|doc_emb|> is placed as

thumb_up_off_alt154

chat_bubble_outline2

repeat17

shareShare

Jina AI

@jinaai_

2 months ago

Heard you like GGUFs and MLX. Our newly released listwise reranker, jina-reranker-v3, is now available in dynamic quantized GGUFs and MLX. Check out our🤗 collection for the weights and arxiv report: huggingface.co/collections/ji…

thumb_up_off_alt129

chat_bubble_outline1

repeat20

shareShare

Elastic

@elastic

2 months ago

We’re excited to announce that we have joined forces with Jina AI, a leader in frontier models for multimodal and multilingual search. This acquisition deepens Elastic’s capabilities in retrieval, embeddings, and context engineering to power agentic AI: go.es.io/48QeYCM

We’re excited to announce that we have joined forces with <a href="/JinaAI_/">Jina AI</a>, a leader in frontier models for multimodal and multilingual search. This acquisition deepens Elastic’s capabilities in retrieval, embeddings, and context engineering to power agentic AI: go.es.io/48QeYCM

thumb_up_off_alt123

chat_bubble_outline14

repeat25

shareShare

Felix

@felix1987_

a month ago

kudo to NEXA AI 👍, your favorite jina-reranker-v2 by Jina AI can run on Qualcomm NPU 👏! This unlocks a lot of imaginations 🔥.

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Jacob Springer

@jacspringer

a month ago

Does synthetic data always help text-embedder models? Not quite. The gains are sparse and come with trade-offs. We open-source data + code to make research on synthetic data for embeddings more rigorous. 1/

thumb_up_off_alt79

chat_bubble_outline3

repeat24

shareShare

tomaarsen

@tomaarsen

a month ago

The MTEB team has just released MTEB v2, an upgrade to their evaluation suite for embedding models! Their blogpost covers all changes, including easier evaluation, multimodal support, rerankers, new interfaces, documentation, dataset statistics, a migration guide, etc. 🧵

thumb_up_off_alt94

chat_bubble_outline4

repeat13

shareShare

Jina AI

@jinaai_

a month ago

In 2 weeks, we're presenting at #EMNLP2025 and hosting a BoF on Embeddings, Rerankers, Small LMs for Better Search, again! Come check out our research on training data for multi-hop reasoning, multimodal embeddings, and where retrieval models are headed in 2025/26. Say hi to our

thumb_up_off_alt14

chat_bubble_outline1

repeat3

shareShare

Han Xiao

@hxiao

a month ago

I'm organizing BoF EMNLP 2025 again, always a fun experience - see u in Suzhou, China!

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

António Loison

@antonio_loison

23 days ago

📢 ViDoRe V3, our new multimodal retrieval benchmark for enterprise use cases, is finally here! It focuses on real-world applied RAG scenarios using high-quality human-verified data. huggingface.co/blog/QuentinJG… 🧵(1/N)