Anshum Gupta (@anshumgupta) Twitter Tweets • TwiCopy

More dimensions isn't always better. Just ask yourself, is 10% more signal with increased hallucinations worth a 99% increase in storage cost? #GenAI #vectors #search #lucenia

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Adrien Grand

@jpountz

2 years ago

Lucene's nightly benchmarks got a massive speedup on queries sorted by numeric field last night: people.apache.org/~mikemccand/lu…. This is due to this PR: github.com/apache/lucene/….

thumb_up_off_alt22

chat_bubble_outline2

repeat7

shareShare

Community Over Code EU is kicking off today in Bratislava, Slovakia! For anyone who lives in/around Slovakia who is interested in attending, check out our 1-day pass for locals. eu.communityovercode.org/tickets/ #CommunityOverCode #opensource

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

ShaneCurcuru everywhere on socials 🗳️💙

@shanecurcuru

a year ago

#CommunityOverCode keynote: EU legislative changes panel: "Many of these standards depend on working with people, which is always hard" Code is easy. People are hard. Optimize for people.

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Anshum Gupta

@anshumgupta

a year ago

Search track at Community Over Code in Bratislava, atitaarora kicks off the day talking about “Navigating Challenges and Enhancing Performance of LLM based Applications” #communityovercode #lucene #solr #theasf

Search track at Community Over Code in Bratislava, <a href="/atitaarora/">atitaarora</a> kicks off the day talking about “Navigating Challenges and Enhancing Performance of LLM based Applications” #communityovercode #lucene #solr #theasf

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Anshum Gupta

@anshumgupta

a year ago

In the follow up talk on the Search track, we have Alessandro Benedetti talk about “Hybrid Search with Apache Solr” #communityovercode #theasf #solr #lucene #llm #search

In the follow up talk on the Search track, we have <a href="/AlexBenedetti/">Alessandro Benedetti</a> talk about “Hybrid Search with Apache Solr” #communityovercode #theasf #solr #lucene #llm #search

thumb_up_off_alt6

chat_bubble_outline0

repeat4

shareShare

Anshum Gupta

@anshumgupta

a year ago

The talk by Angad Sharma talking about Solr at zomato #communityovercode #lucene #solr #zomato

The talk by Angad Sharma talking about Solr at <a href="/zomato/">zomato</a> #communityovercode #lucene #solr #zomato

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Anshum Gupta

@anshumgupta

a year ago

Next up is Yupeng Fu from Uber talking about their use of Lucene for Vector search at Uber and Uber eats! #communityovercode #lucene #solr #vectorsearch

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

ShaneCurcuru everywhere on socials 🗳️💙

@shanecurcuru

a year ago

⁦Dirk-Wiǁem van Gulik⁩ explaining the reasons for the CRA: the Grover Shoe Factory explosion is one example. Fast moving industries move fast and break things - which harms society when steam boilers actually blow up and kills people. #CommunityOverCode

⁦<a href="/dirkx/">Dirk-Wiǁem van Gulik</a>⁩ explaining the reasons for the CRA: the Grover Shoe Factory explosion is one example. Fast moving industries move fast and break things - which harms society when steam boilers actually blow up and kills people. #CommunityOverCode

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Anshum Gupta

@anshumgupta

a year ago

That picture says a lot! #communityovercode

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Charlie Hull

@flaxsearch

a year ago

If you want great AI, talk to a search person.

thumb_up_off_alt16

chat_bubble_outline0

repeat5

shareShare

Adrien Grand

@jpountz

a year ago

"Introduction to Information Retrieval " never gets old. (#53) nlp.stanford.edu/IR-book/inform…

thumb_up_off_alt23

chat_bubble_outline0

repeat7

shareShare

Scott Hanselman 🌮

@shanselman

a year ago

Here’s the thing folks. I’ve been coding 32 years. When something like this happens it’s an organizational failure. Yes, some human wrote a bad line. Someone can “git blame” and point to a human and it’s awful. But it’s the testing, the Cl/CD, the A/B testing, the metered

thumb_up_off_alt7,7K

chat_bubble_outline206

repeat1,1K

shareShare

Michael McCandless

@mikemccand

a year ago

Thank you Vigya Sharma for improving #Lucene's benchmark tooling specifically to make testing Lucene's KNN search a bit easier! Benchmark tooling doesn't get enough open source love when it is arguably more important than the software it is testing. It is a compass that helps

thumb_up_off_alt14

chat_bubble_outline1

repeat1

shareShare

Adrien Grand

@jpountz

a year ago

A one-line removal recently improved performance for one query in Lucene's nightly benchmarks by ~2x people.apache.org/~mikemccand/lu… (annotation GP) github.com/apache/lucene/… This is because this line prevented dynamic pruning from using impacts on the lowest skip level on high-freq terms.

thumb_up_off_alt31

chat_bubble_outline1

repeat4

shareShare

Adrien Grand

@jpountz

a year ago

Lucene just changed the way it stores skip data, from up to 10 levels and stored separately, to at most 2 levels and inlined in postings. This simplification helped speed up queries that botttleneck on advancing by small intervals, see annotation GS at people.apache.org/~mikemccand/lu….

thumb_up_off_alt37

chat_bubble_outline2

repeat4

shareShare

Adrien Grand

@jpountz

a year ago

Lucene 10 was released earlier today. The main release highlight is improved hardware efficiency, I wrote about it on the Elastic blog: elastic.co/search-labs/bl…

thumb_up_off_alt87

chat_bubble_outline2

repeat23

shareShare

Anshum Gupta

Charlie Hull

Anshum Gupta

holden karau

Nick Knize 🌐

Adrien Grand

Apache - The ASF

ShaneCurcuru everywhere on socials 🗳️💙

Anshum Gupta

Anshum Gupta

Anshum Gupta

Anshum Gupta

ShaneCurcuru everywhere on socials 🗳️💙

Anshum Gupta

Charlie Hull

Adrien Grand

Scott Hanselman 🌮

Michael McCandless

Adrien Grand

Adrien Grand

Adrien Grand