Henry AI Labs (@labs_henry) 's Twitter Profile
Henry AI Labs

@labs_henry

YouTube channel explaining Deep Learning papers! youtube.com/channel/UCHB9Vโ€ฆ

ID: 1097643557345783808

linkhttp://www.henryailabs.com calendar_today18-02-2019 23:46:32

1,1K Tweet

1,1K Followers

818 Following

Connor Shorten (@cshorten30) 's Twitter Profile Photo

Late Interaction is not only great for inference, but also for training!! ๐Ÿญ Fine-tuning single-vector embedding models hasnโ€™t really taken offโ€ฆ Late Interaction could change this. One of my favorite takeaways from the podcast, here is a clip explaining this further in ~1

Amรฉlie Chatelain (@amelietabatta) 's Twitter Profile Photo

At LightOn, there's a lot of peer pressure to convert into a late interaction enjoyer! But it just makes sense: it's keyword search, evolved. Semantic search lost token-level granularity. Hybrid patches that. Late interaction fixes it natively: same granularity, learned space.

Bob van Luijt (@bobvanluijt) 's Twitter Profile Photo

๐Ÿ™Œ More than 200 AI developers just weighed in via IT Brand Pulse, and Weaviate swept the vector database categories! ๐Ÿฅ‡ ๐— ๐—ฎ๐—ฟ๐—ธ๐—ฒ๐˜ ๐—Ÿ๐—ฒ๐—ฎ๐—ฑ๐—ฒ๐—ฟ ๐Ÿฅ‡ ๐—œ๐—ป๐˜๐—ฒ๐—น๐—น๐—ถ๐—ด๐—ฒ๐—ป๐—ฐ๐—ฒ & ๐—œ๐—ป๐—ป๐—ผ๐˜ƒ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—Ÿ๐—ฒ๐—ฎ๐—ฑ๐—ฒ๐—ฟ ๐Ÿ”— Read the full breakdown here: itbrandpulse.com/wp-content/uplโ€ฆ

๐Ÿ™Œ More than 200 AI developers just weighed in via IT Brand Pulse, and Weaviate swept the vector database categories!

๐Ÿฅ‡ ๐— ๐—ฎ๐—ฟ๐—ธ๐—ฒ๐˜ ๐—Ÿ๐—ฒ๐—ฎ๐—ฑ๐—ฒ๐—ฟ
๐Ÿฅ‡ ๐—œ๐—ป๐˜๐—ฒ๐—น๐—น๐—ถ๐—ด๐—ฒ๐—ป๐—ฐ๐—ฒ & ๐—œ๐—ป๐—ป๐—ผ๐˜ƒ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—Ÿ๐—ฒ๐—ฎ๐—ฑ๐—ฒ๐—ฟ

๐Ÿ”— Read the full breakdown here: itbrandpulse.com/wp-content/uplโ€ฆ
Victoria Slocum (@victorialslocum) 's Twitter Profile Photo

If you're building a PDF RAG pipeline: Should you be using OCR and ๐˜๐—ฒ๐˜…๐˜-๐—ฏ๐—ฎ๐˜€๐—ฒ๐—ฑ ๐—ฟ๐—ฒ๐˜๐—ฟ๐—ถ๐—ฒ๐˜ƒ๐—ฎ๐—น methods, or just ๐—ฒ๐—บ๐—ฏ๐—ฒ๐—ฑ ๐—ถ๐—บ๐—ฎ๐—ด๐—ฒ๐˜€ ๐—ฑ๐—ถ๐—ฟ๐—ฒ๐—ฐ๐˜๐—น๐˜† using late interaction models? This paper says the answer might actually be ๐˜ฃ๐˜ฐ๐˜ต๐˜ฉ. My colleagues at Weaviate

Connor Shorten (@cshorten30) 's Twitter Profile Photo

What is the state of Cross Encoder API latency? ๐Ÿค” I just ran a quick latency test for Cohere and Voyage's `rerank` APIs. โš™๏ธ The test sweeps across reranking K = 10, 20, 50, 100, 200, and 500 documents. Each document has an average of ~1,000 tokens. Here are the fastest scores

What is the state of Cross Encoder API latency? ๐Ÿค”

I just ran a quick latency test for Cohere and Voyage's `rerank` APIs. โš™๏ธ

The test sweeps across reranking K = 10, 20, 50, 100, 200, and 500 documents. Each document has an average of ~1,000 tokens.

Here are the fastest scores
Antoine Chaffin (@antoine_chaffin) 's Twitter Profile Photo

Until we have multimodal search at the same level of text search, this is probably the best way of having a searchable format of our discussions! Very thorough summary and very happy to have it to share and go back to!

Connor Shorten (@cshorten30) 's Twitter Profile Photo

I learned a lot from our discussion of Reason-ModernColBERT and Reasoning-Intensive Retrieval ๐Ÿง  Firstly, check out the ReasonIR dataset from Meta if you haven't already! This is an incredible resource for training search models! ๐Ÿ› ๏ธ Secondly, there are two things going on with

I learned a lot from our discussion of Reason-ModernColBERT and Reasoning-Intensive Retrieval ๐Ÿง 

Firstly, check out the ReasonIR dataset from Meta if you haven't already! This is an incredible resource for training search models! ๐Ÿ› ๏ธ

Secondly, there are two things going on with
Antoine Chaffin (@antoine_chaffin) 's Twitter Profile Photo

Fine-tuning Reason-ModernColBERT on the AgentIR data and appending reasoning traces boosted the accuracy on BrowseComp-Plus by 10% with OSS btw Great job Zijian Chen

Connor Shorten (@cshorten30) 's Twitter Profile Photo

Hey everyone! I am SUPER EXCITED to publish a new episode of the Weaviate Podcast with Shreya Shankar (Shreya Shankar) on Data Agents! ๐Ÿ‘พ Shreya is a Ph.D. student in the EPIC Data Lab (UC Berkeley EPIC Lab) advised by Aditya Parameswaran (Aditya Parameswaran) at UC Berkeley. Her research focuses on

Hey everyone! I am SUPER EXCITED to publish a new episode of the Weaviate Podcast with Shreya Shankar (<a href="/sh_reya/">Shreya Shankar</a>) on Data Agents! ๐Ÿ‘พ

Shreya is a Ph.D. student in the EPIC Data Lab (<a href="/UCBEPIC/">UC Berkeley EPIC Lab</a>) advised by Aditya Parameswaran (<a href="/adityagp/">Aditya Parameswaran</a>) at UC Berkeley. Her research focuses on
Weaviate Podcast (@weaviatepodcast) 's Twitter Profile Photo

The Data Agent Benchmark measures how well AI Agents can handle complex queries across Multiple Database Systems! ๐ŸŽฏ๐Ÿ‘พ

Weaviate Podcast (@weaviatepodcast) 's Twitter Profile Photo

What if you could filter the objects in a database with natural language commands, rather than relying on pre-computed columns and attributes? โœจ This is the idea behind Semantic Operators, also known as AI SQL. ๐Ÿ‘พ This clip explains this idea furher, one of the most

โ„ฮตsam (@hesamation) 's Twitter Profile Photo

Dear recruiters, if you are writing a job posting for AI Engineering, here is how long each tool has been available, so you don't make a fool of yourself: TensorFlow: 17 years MCP: 6 years vLLM: 7 years Ollama: 10 years CrewAI: 12 years CUDA: 25 years JAX: 11 years Weaviate: 14