Minish (@minishlab) 's Twitter Profile
Minish

@minishlab

Building Model2Vec, SemHash, and Vicinity. Check out our GitHub here: github.com/MinishLab. We are also on HuggingFace: huggingface.co/minishlab

ID: 1884941928183046144

linkhttps://minish.ai/ calendar_today30-01-2025 12:29:19

52 Tweet

88 Takipçi

13 Takip Edilen

Minish (@minishlab) 's Twitter Profile Photo

New model: potion-multilingual-128M, a state-of-the-art static multilingual embedder! 🔥 It supports 101 languages and reaches 90.8% of the performance of LaBSE. Model on HuggingFace: huggingface.co/minishlab/poti… Results: github.com/MinishLab/mode…

New model: potion-multilingual-128M, a state-of-the-art static multilingual embedder! 🔥

It supports 101 languages and reaches 90.8% of the performance of LaBSE.

Model on HuggingFace: huggingface.co/minishlab/poti…

Results: github.com/MinishLab/mode…
Sonam 🚀 (@sonam_pankaj_) 's Twitter Profile Photo

📢 𝗔𝗻𝗻𝗼𝘂𝗻𝗰𝗲𝗺𝗲𝗻𝘁 #𝘦𝘮𝘣𝘦𝘥𝘢𝘯𝘺𝘵𝘩𝘪𝘯𝘨🦀𝘷.0.6 𝘪𝘴 𝘰𝘶𝘵, 𝘸𝘪𝘵𝘩 𝘵𝘩𝘦 𝘭𝘢𝘵𝘦𝘴𝘵 𝘮𝘰𝘥𝘦𝘭𝘴 cohere's 𝘦𝘮𝘣𝘦𝘥4, 𝘢𝘯𝘥 𝘮𝘰𝘥𝘦𝘭2𝘷𝘦𝘤 𝘣𝘺 Minish , 𝘢 𝘯𝘦𝘸 𝘤𝘩𝘶𝘯𝘬𝘪𝘯𝘨 𝘮𝘦𝘵𝘩𝘰𝘥, 𝘓𝘢𝘵𝘦-𝘤𝘩𝘶𝘯𝘬𝘪𝘯𝘨 𝘣𝘺 @Weaviate. RAG,

Ben Burtenshaw (@ben_burtenshaw) 's Twitter Profile Photo

Do not sleep on deduplication! Use this FREE app for semantic deduplication of multiple massive datasets. This is how it works: - You pick one all more datasets from the Hub - It make a semantic embedding of each row - It remove removes near duplicates based on a threshold like

Do not sleep on deduplication! Use this FREE app for semantic deduplication of multiple massive datasets.

This is how it works:

- You pick one all more datasets from the Hub
- It make a semantic embedding of each row
- It remove removes near duplicates based on a threshold like
tomaarsen (@tomaarsen) 's Twitter Profile Photo

The deduplication Space by Minish just got a fresh update, allowing you to remove near duplicates in (training) datasets. Details in 🧵

The deduplication Space by <a href="/minishlab/">Minish</a> just got a fresh update, allowing you to remove near duplicates in (training) datasets. 

Details in 🧵
slm tokens (@tulkenss) 's Twitter Profile Photo

Some guy forked our "model2vec-rs" crate, and put it under the "model2vec" name on crates io and then didn't tell us about it. See here: crates.io/crates/model2v… Like what's the goal here except name squatting.

Minish (@minishlab) 's Twitter Profile Photo

We have a new website (and name): minish.ai We’ve been working on an improved website for a while, and it’s finally here. It has documentation for all our packages as well as our blog. More things coming soon! 🚀