Benjamin Trent (@benwtrent) 's Twitter Profile
Benjamin Trent

@benwtrent

Doer of things | Builder of things | software engineer @elastic.

ID: 4870319492

linkhttps://benwtrent.github.io calendar_today06-02-2016 21:03:23

308 Tweet

281 Takipçi

151 Takip Edilen

Benjamin Trent (@benwtrent) 's Twitter Profile Photo

I really enjoyed this talk by Mark Harwood . He is back at finding crazy & interesting ways to explore data (I guess he never stopped). Clustering with binary vectors & vector search with Elastic youtube.com/watch?v=sJU_8m…

Adrien Grand (@jpountz) 's Twitter Profile Photo

You may not have heard of the Terrier IR platform terrier.org but both Lucene's Similarity framework lucene.apache.org/core/10_1_0/co… and Elasticsearch's Retriever framework elastic.co/guide/en/elast… are heavily inspired by Terrier.

Adrien Grand (@jpountz) 's Twitter Profile Photo

Two good speedups on queries sorted by field on #Lucene's nightly benchmarks last night. This is due to a recent optimization to conjunctive queries being generalized to intersecting a query and a collector. github.com/apache/lucene/…

Two good speedups on queries sorted by field on #Lucene's nightly benchmarks last night. This is due to a recent optimization to conjunctive queries being generalized to intersecting a query and a collector. github.com/apache/lucene/…
Adrien Grand (@jpountz) 's Twitter Profile Photo

Guo Feng contributed a 2.5x (!) speedup to #Lucene's numeric range queries by using vectorization. HZ sped up query evaluation, ID sped up decoding data from the index. Lots of great performance improvements coming in Lucene 10.2.

Guo Feng contributed a 2.5x (!) speedup to #Lucene's numeric range queries by using vectorization. HZ sped up query evaluation, ID sped up decoding data from the index. Lots of great performance improvements coming in Lucene 10.2.
Michael McCandless (@mikemccand) 's Twitter Profile Photo

#Apache #Lucene will soon have a faster and smaller terms index! This is a complex part of Lucene, and a major hotspot for terms heavy use cases like (primary) key/value store (~34% speedup, but results are preliminary!). Lucene's pluggable Codec API makes experimentation like

Adrien Grand (@jpountz) 's Twitter Profile Photo

It's time to redo benchmarks! #Lucene 10.2 was just released, with - huge speedups to non-scoring boolean queries, range queries and filtered vector search, - better merging defaults for faster search, - much faster merging of vectors And more... lucene.apache.org/core/corenews.…

Elastic (@elastic) 's Twitter Profile Photo

Now live! Elastic 9.0/8.18 includes faster quantization, Elastic Distributions of OTel and LLM observability, the GA of Attack Discovery and Automatic Import, major enhancements in ES|QL like JOIN, and more. Learn more → go.es.io/42tjpyA

Now live! Elastic 9.0/8.18 includes faster quantization, Elastic Distributions of OTel and LLM observability, the GA of Attack Discovery and Automatic Import, major enhancements in ES|QL like JOIN, and more.

Learn more → go.es.io/42tjpyA
Costin Leau (@costinl) 's Twitter Profile Photo

🚀 Unlock richer insights in Elasticsearch 8.18/9.0! ES|QL's new LOOKUP JOIN enables powerful query-time joins across indices / datastreams for instant data enrichment elastic.co/search-labs/bl…

Adrien Grand (@jpountz) 's Twitter Profile Photo

I wanted to share what I learned from Tantivy's "Search Benchmark, the Game", so I set up GitHub pages and wrote two blogs, on general observations on the benchmark jpountz.github.io/2025/05/12/ana… and how it helped drive performance improvements in Lucene jpountz.github.io/2025/04/12/why…

Adrien Grand (@jpountz) 's Twitter Profile Photo

There has been a big regression in Lucene's nightly benchmarks recently after a kernel upgrade. Michael McCandless and Robert Charles Muir found that it was caused by a change in the Linux scheduler configuration. github.com/apache/lucene/…

Doug Turnbull (@softwaredoug) 's Twitter Profile Photo

Next in the series of building a search engine from scratch - we focus on hybrid retrieval with Benjamin Trent of Elastic. How do you add filtering to an HNSW index? maven.com/p/430592/hybri…

Doug Turnbull (@softwaredoug) 's Twitter Profile Photo

Several weeks ago, I put the R-in RAG with Hamel Husain by discussing hybrid search best practices. Next up we put the F(ilter) in HNSW to build hybrid search. Which doesn't quite fit... but Benjamin Trent and I are not intimidated by such trivialities maven.com/p/430592/hybri…

Adrien Grand (@jpountz) 's Twitter Profile Photo

Someone asked me for my opinion on the Vespa vs. Elasticsearch performance comparison today at Berlin Buzzwords, so I gave it a try: jpountz.github.io/2025/06/17/ana…

Luca Cavanna (@lucacavanna) 's Twitter Profile Photo

I enjoyed Berlin Buzzwords very much, always a pleasure to catch up with everyone in person. Here is my talk with Adrien Grand on shipping Lucene 10: youtu.be/GRhzgCEL_ac?si… .