HoneyHive (@honeyhiveai) 's Twitter Profile
HoneyHive

@honeyhiveai

Modern AI Observability and Evaluation

ID: 1582081835642896398

linkhttps://honeyhive.ai calendar_today17-10-2022 18:51:32

168 Tweet

378 Followers

0 Following

Zilliz (@zilliz_universe) 's Twitter Profile Photo

Many developers are eager to seek tricks to improve the RAG answer quality, but fail to realize that this is a system optimization problem. Changing one component does not guarantee good overall performance unless there is a scientific way to test the end-to-end quality.

Data Council (@datacouncilai) 's Twitter Profile Photo

Dhruv Singh (Co-Founder, CTO HoneyHive) gives agent evals the time they deserve with his talk โ€œEval Agents: How to Solve Error Cascades in Agents.โ€ Why do these agentic systems spiral into failure, and how do we catch it early? Quietly one of the most important talks of the

<a href="/ds3638/">Dhruv Singh</a> (Co-Founder, CTO <a href="/honeyhiveai/">HoneyHive</a>) gives agent evals the time they deserve with his talk โ€œEval Agents: How to Solve Error Cascades in Agents.โ€

Why do these agentic systems spiral into failure, and how do we catch it early?

Quietly one of the most important talks of the
Qdrant (@qdrant_engine) 's Twitter Profile Photo

๐Ÿงญย ๐€๐๐š๐ฉ๐ญ๐š๐›๐ฅ๐ž ๐‘๐ž๐œ๐จ๐ฆ๐ฆ๐ž๐ง๐๐š๐ญ๐ข๐จ๐ง๐ฌ ๐จ๐ฎ๐ญ๐ฌ๐ข๐๐ž ๐จ๐Ÿ ๐ญ๐ก๐ž ๐’๐ข๐ฆ๐ข๐ฅ๐š๐ซ๐ข๐ญ๐ฒ ๐๐ฎ๐›๐›๐ฅ๐ž ๐ฐ๐ข๐ญ๐ก Qdrant ๐š๐ง๐ HoneyHive. Discovery is exciting: new ideas, books, songs, food! Yet itโ€™s hard to find a perfect framework for it. Recommendation algorithms

๐Ÿงญย ๐€๐๐š๐ฉ๐ญ๐š๐›๐ฅ๐ž ๐‘๐ž๐œ๐จ๐ฆ๐ฆ๐ž๐ง๐๐š๐ญ๐ข๐จ๐ง๐ฌ ๐จ๐ฎ๐ญ๐ฌ๐ข๐๐ž ๐จ๐Ÿ ๐ญ๐ก๐ž ๐’๐ข๐ฆ๐ข๐ฅ๐š๐ซ๐ข๐ญ๐ฒ ๐๐ฎ๐›๐›๐ฅ๐ž ๐ฐ๐ข๐ญ๐ก Qdrant ๐š๐ง๐ HoneyHive.

Discovery is exciting: new ideas, books, songs, food! Yet itโ€™s hard to find a perfect framework for it. Recommendation algorithms
StarTree (@startreedata) 's Twitter Profile Photo

๐Ÿš€Observability and AI are converging, reshaping IT operations. This fusion enhances system insights and AI performance monitoring. Explore how tools like Honeyhive and Arize's Phoenix are leading this transformation! HoneyHive arize-phoenix ๐Ÿ”— Read moreโฌ‡๏ธ๐Ÿ“–

๐Ÿš€Observability and AI are converging, reshaping IT operations. 

This fusion enhances system insights and AI performance monitoring.

 Explore how tools like Honeyhive and Arize's Phoenix are leading this transformation! <a href="/honeyhiveai/">HoneyHive</a> <a href="/ArizePhoenix/">arize-phoenix</a> 
๐Ÿ”— Read moreโฌ‡๏ธ๐Ÿ“–
HoneyHive (@honeyhiveai) 's Twitter Profile Photo

Today we're shipping some major quality-of-life improvements to traces ๐ŸŽ ๐Ÿ” Session Summaries: Unified view of metrics, evals, and feedback across all spans in an agent session. No more jumping between individual spans. โฑ๏ธ Timeline View: Flamegraph visualization to identify

HoneyHive (@honeyhiveai) 's Twitter Profile Photo

Introducing Alerts๐Ÿ”” Alerts in HoneyHive give you real-time monitoring over everything that matters in your agent: โœ… Metric drift - Detect quality degradation over time โœ… Cost spikes - Stay within budget thresholds with usage alerts โœ… Guardrail violations - Monitor safety

Introducing Alerts๐Ÿ””

Alerts in <a href="/honeyhiveai/">HoneyHive</a> give you real-time monitoring over everything that matters in your agent:

โœ… Metric drift - Detect quality degradation over time
โœ… Cost spikes - Stay within budget thresholds with usage alerts
โœ… Guardrail violations - Monitor safety
ScaleUp:AI (@scaleupevent) 's Twitter Profile Photo

Hear from the software leaders driving AI forward on why theyโ€™re excited for ScaleUp:AI (just two weeks away!) โ€” and stayed tuned to hear whatโ€™s next for the future of the industry. Featuring: โ€ข Kiteworks' CEO Jonathan Yaron โ€ข Lightrun's Cofounder and CEO Ilan Peleg โ€ข