Connor Shorten (@cshorten30) 's Twitter Profile
Connor Shorten

@cshorten30

AI and Databases @weaviate_io, host of the @weaviatepodcast

ID: 840746981190447105

linkhttps://github.com/CShorten calendar_today12-03-2017 02:11:15

16,16K Tweet

17,17K Followers

15,15K Following

Brendan Hogan (@brendanh0gan) 's Twitter Profile Photo

introducing: picoDeepResearch multi-turn tool use + soft rewards + self-play + GRPO You define the arena (report prompts + judging principles) the model generates reports, uses tools (web search), then competes in round-robin battles judged by an LLM winner gets the gradient

introducing: picoDeepResearch

multi-turn tool use + soft rewards + self-play + GRPO

You define the arena (report prompts + judging principles)

the model generates reports, uses tools (web search), then competes in round-robin battles judged by an LLM

winner gets the gradient
Sumit (@_reachsumit) 's Twitter Profile Photo

I published Vol. 105 of "Top Information Retrieval Papers of the Week" on Substack. 🔗 recsys.substack.com/p/training-mul…

I published Vol. 105 of "Top Information Retrieval Papers of the Week" on Substack.
🔗 recsys.substack.com/p/training-mul…
fly51fly (@fly51fly) 's Twitter Profile Photo

[LG] DataRater: Meta-Learned Dataset Curation D A. Calian, G Farquhar, I Kemaev, L M. Zintgraf... [Google DeepMind] (2025) arxiv.org/abs/2505.17895

[LG] DataRater: Meta-Learned Dataset Curation
D A. Calian, G Farquhar, I Kemaev, L M. Zintgraf... [Google DeepMind] (2025)
arxiv.org/abs/2505.17895
Victoria Slocum (@victorialslocum) 's Twitter Profile Photo

There are really only 𝟮 𝘄𝗮𝘆𝘀 to level up your AI agent's performance. You can either: 1️⃣ Choose a better model 2️⃣ Optimize your prompts That's it. Everything else is just details. For model selection, I've tested dozens of options, and 𝗚𝗲𝗺𝗶𝗻𝗶 consistently stands out

There are really only 𝟮 𝘄𝗮𝘆𝘀 to level up your AI agent's performance.

You can either:
1️⃣ Choose a better model
2️⃣ Optimize your prompts

That's it. Everything else is just details.

For model selection, I've tested dozens of options, and 𝗚𝗲𝗺𝗶𝗻𝗶 consistently stands out
Maxime Rivest 🧙‍♂️🦙 (@maximerivest) 's Twitter Profile Photo

Thanks to Omar Khattab's and Cyrus's suggestions we now have a special module for dspy within attachments. No need for .dspy() everywhere😍 Also, don't be fool by the simplicity of this code. This code is a fully working llm program! This program loads power points

Thanks to <a href="/lateinteraction/">Omar Khattab</a>'s  and <a href="/cyrusnewday/">Cyrus</a>'s suggestions we now have a special module for dspy within attachments. No need for .dspy() everywhere😍

Also, don't be fool by the simplicity of this code. This code is a fully working llm program! This program loads power points
Leonard Tang (@leonardtang_) 's Twitter Profile Photo

You don’t need frontier lab resources for frontier lab automated LLM evaluation. To prove this, we’re open-sourcing j1-nano and j1-micro: two absurdly tiny (600M & 1.7B parameters) but mighty reward models competitive with orders-of-magnitude larger peers. j1-nano and j1-micro

You don’t need frontier lab resources for frontier lab automated LLM evaluation.

To prove this, we’re open-sourcing j1-nano and j1-micro: two absurdly tiny (600M &amp; 1.7B parameters) but mighty reward models competitive with orders-of-magnitude larger peers.

j1-nano and j1-micro
Connor Shorten (@cshorten30) 's Twitter Profile Photo

Multi-Vector Retrieval methods such as ColBERT / ColPali are super powerful *but* often too costly to store and search! Well... not anymore! Introducing MUVERA! 🚀 I am SUPER EXCITED to publish our newest Weaviate Podcast on MUVERA with Rajesh Jayaram (Rajesh Jayaram) from

Multi-Vector Retrieval methods such as ColBERT / ColPali are super powerful *but* often too costly to store and search! Well... not anymore! Introducing MUVERA! 🚀

I am SUPER EXCITED to publish our newest Weaviate Podcast on MUVERA with Rajesh Jayaram (<a href="/Raj_Jayaram_/">Rajesh Jayaram</a>) from
Omar Khattab (@lateinteraction) 's Twitter Profile Photo

If you're in SF in 2 weeks, see you at the @Databricks Data + AI Summit, where we have a long list of sessions on DSPy, including: ■ DSPy 3.0 by Krista Opsahl-Ong & me (Jun 11) ■ DSPy <> MLflow by Chen Qian (Jun 11) ■ Opinionated intro to DSPy by Drew Breunig (Jun 10)

If you're in SF in 2 weeks, see you at the @Databricks Data + AI Summit, where we have a long list of sessions on <a href="/DSPyOSS/">DSPy</a>, including:

■ DSPy 3.0 by <a href="/kristahopsalong/">Krista Opsahl-Ong</a> &amp; me (Jun 11)

■ DSPy &lt;&gt; MLflow by <a href="/ChenMoneyQ/">Chen Qian</a> (Jun 11)

■ Opinionated intro to DSPy by <a href="/dbreunig/">Drew Breunig</a> (Jun 10)
Matei Zaharia (@matei_zaharia) 's Twitter Profile Photo

Apache Spark 4.0 is out with some huge improvements across the board. SQL’s much more powerful, Spark Connect makes it easier to run apps, new languages and more. It’s amazing to see the community still growing fast and releasing over 5000 patches in 4.0. databricks.com/blog/introduci…

dennylee (@dennylee) 's Twitter Profile Photo

#DennysPick 🌟 If you’re building GenAI applications with DSPy and want full observability, seamless debugging, and true MLOps best practices, this session is a must-see at #DataAISummit! I’m excited to recommend “Streamlining DSPy Development: Track, Debug, and Deploy With

#DennysPick 🌟 If you’re building GenAI applications with DSPy and want full observability, seamless debugging, and true MLOps best practices, this session is a must-see at <a href="/Data_AI_Summit/">#DataAISummit</a>! 

I’m excited to recommend “Streamlining DSPy Development: Track, Debug, and Deploy With
DSPy (@dspyoss) 's Twitter Profile Photo

When people say “multi-agent”, 80% of the time they just want to write a *multi-module* DSPy program, in which case it’s been supported since 2022! 15% of the time they want to give dspy.ReAct `tools` that are themselves agents, in which case it’s been supported since Oct 2023.