
Freddie Vargus
@freddie_v4
CTO & co-founder @quotientai
Research @cohere_labs
—
past: evals @github Copilot, data @quantopian
—
Tico 🇨🇷🇺🇸
ID: 614740050
https://github.com/quotient-ai/judges 21-06-2012 23:20:04
524 Tweet
799 Takipçi
1,1K Takip Edilen

Most teams only find out their AI is broken when someone complains or churns. Your agents shouldn’t fail silently. We’re launching Quotient AI Detections: a system to catch agent mistakes, identify how they happened, and automatically fix them.

If you're building AI apps and flying blind, we can help. → Sign up: app.quotientai.co → Grab $250 in credits with the ElevenLabs AI Engineer Pack: aiengineerpack.com → Join our Discord: discord.com/invite/YeJzANp… Let us help you understand how your agents fail,

We just launched Quotient AI Detections: the first system that helps teams catch AI failures before their users do. As part of the launch, we partnered with ElevenLabs to offer coupons through the AI Engineer Pack: → 1,000,000 extra logs → 10,000 free detections → $250+

HypoEval evaluators (github.com/ChicagoHAI/Hyp…) are now incorporated into judges from Quotient AI — check it out at github.com/quotient-ai/ju…!

detections go brrr One week in, Quotient AI Detections has processed 20M+ tokens, analyzed tens of thousands of logs, and caught thousands of hallucinations across real AI production apps. Still a long way to go, but we're committed to giving builders SOTA AI monitoring.




headed to SF for AI Engineer World’s Fair! Who wants to meet up? Who should I meet? Will be around until Thursday evening

it was a pleasure speaking at AI Engineer with Maitar Asher 🎗️ from tavily and Deanna Emery from Quotient AI 🫡


retrieval + search track = best vibes AI Engineer ft Maitar Asher 🎗️ Deanna Emery Jerry Liu tavily Quotient AI LlamaIndex 🦙






“You want your model hitting milestones, not minefields.” Most AI eval talk is hand-wavy. This isn’t. Freddie Vargus (Quotient AI CTO) gets into the weeds: how to actually test tool use, avoid minefields, and build agents that don’t break. Check out the recording👇