Noah Ziems (@noahziems) 's Twitter Profile
Noah Ziems

@noahziems

PhD student @NotreDame studying NLP advised by @Meng_CS working on LLM evals. DSPy contributor.

ID: 1362445364205355015

linkhttp://noahziems.com calendar_today18-02-2021 16:54:29

315 Tweet

330 Followers

1,1K Following

Shaokun Zhang (@shaokunzhang1) 's Twitter Profile Photo

(ICML 2025 Spotlight-Top 2.6%) Multi-agent LLM systems still fail—but who caused it, and when? 🔥 We introduce Who&When, the first benchmark for automated failure attribution in LLM multi-agent systems. We also define a new task: automatically identifying which agent failed, and

(ICML 2025 Spotlight-Top 2.6%) Multi-agent LLM systems still fail—but who caused it, and when?

🔥 We introduce Who&When, the first benchmark for automated failure attribution in LLM multi-agent systems. We also define a new task: automatically identifying which agent failed, and
Noah Ziems (@noahziems) 's Twitter Profile Photo

Even just for organizing and orchestrating complex prompt-based systems with no optimization, DSPy is absolutely wonderful to use

Noah Ziems (@noahziems) 's Twitter Profile Photo

Ilya had a really good slide about how “success is guaranteed” if you have: 1. Big model 2. High quality data 3. Lots of (2) And that defined the next few years of ML It is now going to be something like: 1. Big model 2. Lots of diverse inputs 3. High quality reward signal

Ilya had a really good slide about how “success is guaranteed” if you have:

1.  Big model
2. High quality data
3. Lots of (2)

And that defined the next few years of ML

It is now going to be something like:
1.  Big model
2. Lots of diverse inputs
3. High quality reward signal
Noah Ziems (@noahziems) 's Twitter Profile Photo

Moving forward I think RL papers should be reporting results with random rewards as a baseline instead of just the base model

Andrew Ng (@andrewyng) 's Twitter Profile Photo

New short course: DSPy: Build and Optimize Agentic Apps DSPy is a powerful open-source framework for automatically tuning prompts for GenAI applications. In this course, you'll learn to use DSPy, together with MLflow. This is built in partnership with Databricks and taught by

Noah Ziems (@noahziems) 's Twitter Profile Photo

Before LLMs, people were very precise when discussing the inputs/outputs of ML systems and the flexibility of LLMs have allowed us to become a little lazy in this department

DSPy (@dspyoss) 's Twitter Profile Photo

Maxime Rivest 🧙‍♂️🦙 Are you perhaps looking for our own Noah Ziems's Arbor? It can be used with or without DSPy, and it has the perfect OpenAI-compatible client-server architecture that works well for LLM inference. github.com/Ziems/arbor