Minish (@minishlab) 's Twitter Profile
Minish

@minishlab

Building Model2Vec, SemHash, and Vicinity. Check out our GitHub here: github.com/MinishLab. We are also on HuggingFace: huggingface.co/minishlab

ID: 1884941928183046144

linkhttps://minish.ai/ calendar_today30-01-2025 12:29:19

52 Tweet

88 Followers

13 Following

Minish (@minishlab) 's Twitter Profile Photo

New model: potion-multilingual-128M, a state-of-the-art static multilingual embedder! ๐Ÿ”ฅ It supports 101 languages and reaches 90.8% of the performance of LaBSE. Model on HuggingFace: huggingface.co/minishlab/potiโ€ฆ Results: github.com/MinishLab/modeโ€ฆ

New model: potion-multilingual-128M, a state-of-the-art static multilingual embedder! ๐Ÿ”ฅ

It supports 101 languages and reaches 90.8% of the performance of LaBSE.

Model on HuggingFace: huggingface.co/minishlab/potiโ€ฆ

Results: github.com/MinishLab/modeโ€ฆ
Sonam ๐Ÿš€ (@sonam_pankaj_) 's Twitter Profile Photo

๐Ÿ“ข ๐—”๐—ป๐—ป๐—ผ๐˜‚๐—ป๐—ฐ๐—ฒ๐—บ๐—ฒ๐—ป๐˜ #๐˜ฆ๐˜ฎ๐˜ฃ๐˜ฆ๐˜ฅ๐˜ข๐˜ฏ๐˜บ๐˜ต๐˜ฉ๐˜ช๐˜ฏ๐˜จ๐Ÿฆ€๐˜ท.0.6 ๐˜ช๐˜ด ๐˜ฐ๐˜ถ๐˜ต, ๐˜ธ๐˜ช๐˜ต๐˜ฉ ๐˜ต๐˜ฉ๐˜ฆ ๐˜ญ๐˜ข๐˜ต๐˜ฆ๐˜ด๐˜ต ๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ๐˜ด cohere's ๐˜ฆ๐˜ฎ๐˜ฃ๐˜ฆ๐˜ฅ4, ๐˜ข๐˜ฏ๐˜ฅ ๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ2๐˜ท๐˜ฆ๐˜ค ๐˜ฃ๐˜บ Minish , ๐˜ข ๐˜ฏ๐˜ฆ๐˜ธ ๐˜ค๐˜ฉ๐˜ถ๐˜ฏ๐˜ฌ๐˜ช๐˜ฏ๐˜จ ๐˜ฎ๐˜ฆ๐˜ต๐˜ฉ๐˜ฐ๐˜ฅ, ๐˜“๐˜ข๐˜ต๐˜ฆ-๐˜ค๐˜ฉ๐˜ถ๐˜ฏ๐˜ฌ๐˜ช๐˜ฏ๐˜จ ๐˜ฃ๐˜บ @Weaviate. RAG,

Ben Burtenshaw (@ben_burtenshaw) 's Twitter Profile Photo

Do not sleep on deduplication! Use this FREE app for semantic deduplication of multiple massive datasets. This is how it works: - You pick one all more datasets from the Hub - It make a semantic embedding of each row - It remove removes near duplicates based on a threshold like

Do not sleep on deduplication! Use this FREE app for semantic deduplication of multiple massive datasets.

This is how it works:

- You pick one all more datasets from the Hub
- It make a semantic embedding of each row
- It remove removes near duplicates based on a threshold like
tomaarsen (@tomaarsen) 's Twitter Profile Photo

The deduplication Space by Minish just got a fresh update, allowing you to remove near duplicates in (training) datasets. Details in ๐Ÿงต

The deduplication Space by <a href="/minishlab/">Minish</a> just got a fresh update, allowing you to remove near duplicates in (training) datasets. 

Details in ๐Ÿงต
slm tokens (@tulkenss) 's Twitter Profile Photo

Some guy forked our "model2vec-rs" crate, and put it under the "model2vec" name on crates io and then didn't tell us about it. See here: crates.io/crates/model2vโ€ฆ Like what's the goal here except name squatting.

Minish (@minishlab) 's Twitter Profile Photo

We have a new website (and name): minish.ai Weโ€™ve been working on an improved website for a while, and itโ€™s finally here. It has documentation for all our packages as well as our blog. More things coming soon! ๐Ÿš€