Nils Reimers (@nils_reimers) 's Twitter Profile
Nils Reimers

@nils_reimers

VP AI Search @Cohere | ex-huggingface | Creator of SBERT (sbert.net)

ID: 763665948268265473

linkhttps://www.nils-reimers.de calendar_today11-08-2016 09:18:45

2,2K Tweet

13,13K Followers

461 Following

Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

Cohere presents Aya Vision: Advancing the Frontier of Multilingual Multimodality - Aya-Vision-8B outperforms Qwen-2.5-VL-7B - Aya-Vision-32B outperforms Qwen-2.5-VL-72B

Cohere presents Aya Vision: Advancing the Frontier of Multilingual Multimodality

- Aya-Vision-8B outperforms Qwen-2.5-VL-7B
- Aya-Vision-32B outperforms Qwen-2.5-VL-72B
𝚐𝔪𝟾𝚡𝚡𝟾 (@gm8xx8) 's Twitter Profile Photo

Cohere introduces Aya-Vision, a family of 8B and 32B vision-language models for multilingual open-ended interaction. - Synthetic annotation: high-diversity recaptioning + hybrid MT + LLM rephrasing across 23 languages - Cross-modal model merging: training-free, restores 94.1% of

Cohere introduces Aya-Vision, a family of 8B and 32B vision-language models for multilingual open-ended interaction.

- Synthetic annotation: high-diversity recaptioning + hybrid MT + LLM rephrasing across 23 languages
- Cross-modal model merging: training-free, restores 94.1% of
cohere (@cohere) 's Twitter Profile Photo

We’re partnering with Dell to accelerate secure, agentic enterprise AI solutions. Dell will be the first provider to offer our secure agents platform, Cohere North, to enterprises on-premises, which is crucial for regulated industries handling sensitive data 🧵

We’re partnering with <a href="/Dell/">Dell</a> to accelerate secure, agentic enterprise AI solutions.

Dell will be the first provider to offer our secure agents platform, Cohere North, to enterprises on-premises, which is crucial for regulated industries handling sensitive data 🧵
cohere (@cohere) 's Twitter Profile Photo

.DraftWise is transforming how lawyers draft and negotiate contracts! Powered by our Command, Embed, & Rerank models on Microsoft's Microsoft Azure AI Foundry, Draftwise automates the most tedious tasks for lawyers & delivers accurate information, even for complex use cases.

Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) (@rao2z) 's Twitter Profile Photo

Do Intermediate Tokens Produced by LRMs (need to) have any semantics? Our new study "Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens" lead by kstechly, Karthik Valmeekam Atharva & Vardhan Palod dives into this question 🧵 1/

Do Intermediate Tokens Produced by LRMs (need to) have any semantics? Our new study "Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens"  lead by <a href="/kayastechly/">kstechly</a>, <a href="/karthikv792/">Karthik Valmeekam</a> <a href="/_gundawar/">Atharva</a> &amp; <a href="/PalodVardh12428/">Vardhan Palod</a> dives into this question 🧵 1/
BetaKit (@betakit) 's Twitter Profile Photo

.cohere CEO claims #AI startup “not far away” from profitability as it snags Dell and SAP partnerships. #CDNtech betakit.com/cohere-ceo-cla…

Michael Black (@michael_j_black) 's Twitter Profile Photo

If you're an international PhD student at Harvard studying computer vision and your visa is cancelled, reach out to me or others in Europe. Don't despair. I'm sure we can find you a great place to carry on your research.

Maxime Labonne (@maximelabonne) 's Twitter Profile Photo

The French Ministry of Culture released 175k high-quality arena-style preferences It's exactly the type of data LMSYS stopped releasing. They created their own chatbot arena with 55 models and open-sourced everything. Incredible work! 🤗 Dataset: huggingface.co/datasets/minis…

The French Ministry of Culture released 175k high-quality arena-style preferences

It's exactly the type of data LMSYS stopped releasing.

They created their own chatbot arena with 55 models and open-sourced everything. Incredible work!

🤗 Dataset: huggingface.co/datasets/minis…
Stella Li (@stellalisy) 's Twitter Profile Photo

🤯 We cracked RLVR with... Random Rewards?! Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by: - Random rewards: +21% - Incorrect rewards: +25% - (FYI) Ground-truth rewards: + 28.8% How could this even work⁉️ Here's why: 🧵 Blogpost: tinyurl.com/spurious-rewar…

🤯 We cracked RLVR with... Random Rewards?!
Training Qwen2.5-Math-7B with our Spurious Rewards improved MATH-500 by:
- Random rewards: +21%
- Incorrect rewards: +25%
- (FYI) Ground-truth rewards: + 28.8%
How could this even work⁉️ Here's why: 🧵
Blogpost: tinyurl.com/spurious-rewar…
Nathan Lambert (@natolambert) 's Twitter Profile Photo

All the american companies hiding reasoning traces for their models is making the research ecosystem default all of its efforts on using DeepSeek as the "frontier reasoning model" to study. Would be very easy to fix. Gemini's change to remove them is sad and doesn't make sense.

kepano (@kepano) 's Twitter Profile Photo

OpenAI is now required by court order to preserve all ChatGPT logs including "temporary chats" and API requests that would have been deleted if I understand this correctly, it means data retention policies for apps that use OpenAI API simply cannot be honored

OpenAI is now required by court order to preserve all ChatGPT logs including "temporary chats" and API requests that would have been deleted 

if I understand this correctly, it means data retention policies for apps that use OpenAI API simply cannot be honored
SemiAnalysis (@semianalysis_) 's Twitter Profile Photo

Every year, Jensen drops new math rules that adds confusion to the whole industry. This year is no different, we now observe a third new Jensen math rule.  The first Jensen math rule is that NVIDIA headline FLOPs are quoted with 2:4 sparsity (which no one uses out of China)

Nils Reimers (@nils_reimers) 's Twitter Profile Photo

Exciting to partner with Ensemble, who manage $40 billion in annual net patient revenue, to bring secure AI agents into the healthcare industry.