Anshum Gupta (@anshumgupta) 's Twitter Profile
Anshum Gupta

@anshumgupta

Committer on Apache Lucene/Solr. Building search @ , Search engines and more... Barça and Messi supporter. Tweets/opinions are mine!

ID: 62562318

linkhttp://www.anshumgupta.net calendar_today03-08-2009 17:03:44

8,8K Tweet

1,1K Followers

594 Following

holden karau (@holdenkarau) 's Twitter Profile Photo

If your an ASF committer definitely check out this event (and then give me a summary after since I have an offsite commitment that overlaps).

Nick Knize 🌐 (@nknize) 's Twitter Profile Photo

More dimensions isn't always better. Just ask yourself, is 10% more signal with increased hallucinations worth a 99% increase in storage cost? #GenAI #vectors #search #lucenia

Adrien Grand (@jpountz) 's Twitter Profile Photo

Lucene's nightly benchmarks got a massive speedup on queries sorted by numeric field last night: people.apache.org/~mikemccand/lu…. This is due to this PR: github.com/apache/lucene/….

Apache - The ASF (@theasf) 's Twitter Profile Photo

Community Over Code EU is kicking off today in Bratislava, Slovakia! For anyone who lives in/around Slovakia who is interested in attending, check out our 1-day pass for locals. eu.communityovercode.org/tickets/ #CommunityOverCode #opensource

Community Over Code EU is kicking off today in Bratislava, Slovakia!

For anyone who lives in/around Slovakia who is interested in attending, check out our 1-day pass for locals. eu.communityovercode.org/tickets/

#CommunityOverCode #opensource
ShaneCurcuru everywhere on socials 🗳️💙 (@shanecurcuru) 's Twitter Profile Photo

#CommunityOverCode keynote: EU legislative changes panel: "Many of these standards depend on working with people, which is always hard" Code is easy. People are hard. Optimize for people.

#CommunityOverCode keynote: EU legislative changes panel: "Many of these standards depend on working with people, which is always hard"

Code is easy.
People are hard.
Optimize for people.
Anshum Gupta (@anshumgupta) 's Twitter Profile Photo

Search track at Community Over Code in Bratislava, atitaarora kicks off the day talking about “Navigating Challenges and Enhancing Performance of LLM based Applications” #communityovercode #lucene #solr #theasf

Search track at Community Over Code in Bratislava, <a href="/atitaarora/">atitaarora</a> kicks off the day talking about “Navigating Challenges and Enhancing Performance of LLM based Applications” #communityovercode #lucene #solr #theasf
Anshum Gupta (@anshumgupta) 's Twitter Profile Photo

In the follow up talk on the Search track, we have Alessandro Benedetti talk about “Hybrid Search with Apache Solr” #communityovercode #theasf #solr #lucene #llm #search

In the follow up talk on the Search track, we have <a href="/AlexBenedetti/">Alessandro Benedetti</a> talk about “Hybrid Search with Apache Solr” #communityovercode #theasf #solr #lucene #llm #search
Anshum Gupta (@anshumgupta) 's Twitter Profile Photo

Next up is Yupeng Fu from Uber talking about their use of Lucene for Vector search at Uber and Uber eats! #communityovercode #lucene #solr #vectorsearch

Next up is Yupeng Fu from Uber talking about their use of Lucene for Vector search at Uber and Uber eats! #communityovercode #lucene #solr #vectorsearch
ShaneCurcuru everywhere on socials 🗳️💙 (@shanecurcuru) 's Twitter Profile Photo

Dirk-Wiǁem van Gulik⁩ explaining the reasons for the CRA: the Grover Shoe Factory explosion is one example. Fast moving industries move fast and break things - which harms society when steam boilers actually blow up and kills people. #CommunityOverCode

⁦<a href="/dirkx/">Dirk-Wiǁem van Gulik</a>⁩ explaining the reasons for the CRA: the Grover Shoe Factory explosion is one example. Fast moving industries move fast and break things - which harms society when steam boilers actually blow up and kills people. #CommunityOverCode
Scott Hanselman 🌮 (@shanselman) 's Twitter Profile Photo

Here’s the thing folks. I’ve been coding 32 years. When something like this happens it’s an organizational failure. Yes, some human wrote a bad line. Someone can “git blame” and point to a human and it’s awful. But it’s the testing, the Cl/CD, the A/B testing, the metered

Michael McCandless (@mikemccand) 's Twitter Profile Photo

Thank you Vigya Sharma for improving #Lucene's benchmark tooling specifically to make testing Lucene's KNN search a bit easier! Benchmark tooling doesn't get enough open source love when it is arguably more important than the software it is testing. It is a compass that helps

Adrien Grand (@jpountz) 's Twitter Profile Photo

A one-line removal recently improved performance for one query in Lucene's nightly benchmarks by ~2x people.apache.org/~mikemccand/lu… (annotation GP) github.com/apache/lucene/… This is because this line prevented dynamic pruning from using impacts on the lowest skip level on high-freq terms.

Adrien Grand (@jpountz) 's Twitter Profile Photo

Lucene just changed the way it stores skip data, from up to 10 levels and stored separately, to at most 2 levels and inlined in postings. This simplification helped speed up queries that botttleneck on advancing by small intervals, see annotation GS at people.apache.org/~mikemccand/lu….

Adrien Grand (@jpountz) 's Twitter Profile Photo

Lucene 10 was released earlier today. The main release highlight is improved hardware efficiency, I wrote about it on the Elastic blog: elastic.co/search-labs/bl…