Chris Mauck (@cmauck10) Twitter Tweets • TwiCopy

KonfHub

2 years ago

Dive into the fascinating world of AI with the #DataStories #BloggingContest by Cleanlab ! 🤖 Check out the coming week's spotlight theme '#LargeLanguageModels'. Share your insights, submit your blogs, and stand a chance to win exciting prizes! 🏆 📝datastories.konfhub.com

Dive into the fascinating world of AI with the #DataStories #BloggingContest by <a href="/CleanlabAI/">Cleanlab</a> ! 🤖

Check out the coming week's spotlight theme '#LargeLanguageModels'. Share your insights, submit your blogs, and stand a chance to win exciting prizes! 🏆

📝datastories.konfhub.com

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Santiago

@svpino

2 years ago

An open-source 10x library for anyone working with data: github.com/cleanlab/clean… The team at Cleanlab just finished a presentation for the ml.school community. Their tool is mind-blowing. With a single line of Python code, it will: • Detect common data

An open-source 10x library for anyone working with data:

github.com/cleanlab/clean…

The team at <a href="/CleanlabAI/">Cleanlab</a> just finished a presentation for the ml.school community.

Their tool is mind-blowing. With a single line of Python code, it will:

• Detect common data

thumb_up_off_alt484

chat_bubble_outline6

repeat86

shareShare

Akshay 🚀

@akshay_pachaar

2 years ago

Seven Must-Follow Accounts for anyone in the Python, AI/ML space: - Cutting-edge AI research: Sebastian Raschka - Top ML content: Santiago - Python: Mike Driscoll - SOTA MLOps/LLMOps: @AbacusAI - Learn & Build with AI: Lightning AI ⚡️ - Data-centric AI at it's finest: Cleanlab - AI

thumb_up_off_alt209

chat_bubble_outline13

repeat43

shareShare

Curtis G. Northcutt

@cgnorthcutt

2 years ago

Cleanlab made the Forbes AI 50 list! It is an honor to be recognized alongside friends and inspirational companies like OpenAI, Databricks, and Hugging Face. We're just getting started... forbes.com/lists/ai50

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Chris Mauck

@cmauck10

2 years ago

Come check out the brand new Trustworthy Language Model --- adding trust and reliability to every LLM! The playground is so cool...

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Cleanlab

@cleanlabai

a year ago

Product Announcement: Introducing Cleanlab Studio Auto-Labeling Agent! Annotating a dataset? Save time with Auto-Labeling Agent, which suggests new labels with confidence levels - completing your dataset effortlessly. cleanlab.ai/blog/auto-labe…

thumb_up_off_alt14

chat_bubble_outline2

repeat3

shareShare

Cleanlab

@cleanlabai

a year ago

Struggling to deploy RAG into production? Our newest video shows how to: - Ensure documents are free of: (near) duplicates, PII, low-quality or non-English text,... - Add smart metadata to improve Retrieval - Gauge the trustworthiness of each LLM answer youtube.com/watch?v=xpDidd…

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Cleanlab

@cleanlabai

a year ago

How many "r" in strawberry?? Today we're excited to announce a new way to catch and explain hallucinations from any LLM! It’s been over a year since the release of GPT-4, but these models remain fundamentally unreliable and risky to use in high-stakes applications. The

thumb_up_off_alt12

chat_bubble_outline1

repeat3

shareShare

Chris Mauck

@cmauck10

a year ago

What's more exciting than #RAG? Agentic RAG! Checkout my new blog on using LLM trustworthiness scores to automatically optimize retrieval strategy complexity. pub.towardsai.net/reliable-agent…

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Cleanlab

@cleanlabai

a year ago

Introducing Agentic RAG with LLM trustworthiness estimates -- A framework to ensure reliable answers in Retrieval-Augmented Generation and keep latency/costs in check. The idea: Assess response trustworthiness and then adjust retrieval plans to ensure sufficient context [...🧵]

thumb_up_off_alt19

chat_bubble_outline1

repeat2

shareShare

Cleanlab

@cleanlabai

a year ago

Don’t want users to lose trust in your RAG system? Then add automated hallucination detection. A new benchmark across 4 RAG datasets reveals which detector best flags incorrect AI responses (amongst RAGAS, G-eval, DeepEval, TLM, LLM self-evaluation) 👇 cleanlab.ai/blog/rag-tlm-h…

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Cleanlab

@cleanlabai

a year ago

Want to reduce the error-rate of responses from OpenAI’s o1 LLM by over 20% and also catch incorrect responses in real-time? Just published: 3 benchmarks demonstrating this can be achieved with the Trustworthy Language Model (TLM) framework [...]

thumb_up_off_alt11

chat_bubble_outline1

repeat4

shareShare

Curtis G. Northcutt

@cgnorthcutt

a year ago

NEWS: Cleanlab + Pinecone set a new standard for trustworthy GenAI/RAG! Our latest: AI that’s accurate, curated, and hallucination-free using Cleanlab’ knowledge curation and Pinecone’s vector search. Reliable responses and trust scoring. Full blog 👇pinecone.io/learn/building…

thumb_up_off_alt9

chat_bubble_outline0

repeat3

shareShare

LangChain

@langchainai

a year ago

🧹 Hallucination detection from Cleanlab 🧪 The new tlm-langchain package augments your LangChain / LangGraph applications with an LLM trustworthiness score. Cleanlab 's Trustworthy Language Model detects incorrect LLM outputs in real-time via state-of-the-art uncertainty

thumb_up_off_alt220

chat_bubble_outline3

repeat42

shareShare

Akshay 🚀

@akshay_pachaar

10 months ago

Let's build a trustworthy RAG app that provides a confidence score for each response:

thumb_up_off_alt443

chat_bubble_outline11

repeat42

shareShare

arize-phoenix

@arizephoenix

9 months ago

Better LLMs start with better data and observability We’ve integrated Cleanlab's Trustworthy Language Model (TLM) with Phoenix to help teams improve LLM reliability and performance 🔍 TLM automatically identifies mislabeled, low-quality, or ambiguous training data—ensuring

Better LLMs start with better data and observability

We’ve integrated <a href="/CleanlabAI/">Cleanlab</a>'s Trustworthy Language Model (TLM) with Phoenix to help teams improve LLM reliability and performance

🔍 TLM automatically identifies mislabeled, low-quality, or ambiguous training data—ensuring

thumb_up_off_alt15

chat_bubble_outline2

repeat6

shareShare

MLflow

@mlflow

8 months ago

🚀 New on the MLflow blog: Automatically find the bad LLM responses in your LLM Evals with Cleanlab! Cleanlab’s Trustworthy Language Models (TLM) analyze prompts and responses to calculate a 𝚝𝚛𝚞𝚜𝚝𝚠𝚘𝚛𝚝𝚑𝚒𝚗𝚎𝚜𝚜_𝚜𝚌𝚘𝚛𝚎 — flagging potentially incorrect or

🚀 New on the MLflow blog: Automatically find the bad LLM responses in your LLM Evals with <a href="/CleanlabAI/">Cleanlab</a>!

Cleanlab’s Trustworthy Language Models (TLM) analyze prompts and responses to calculate a 𝚝𝚛𝚞𝚜𝚝𝚠𝚘𝚛𝚝𝚑𝚒𝚗𝚎𝚜𝚜_𝚜𝚌𝚘𝚛𝚎 — flagging potentially incorrect or

thumb_up_off_alt7

chat_bubble_outline0

repeat4

shareShare

Curtis G. Northcutt

@cgnorthcutt

8 months ago

Tomorrow I'm spilling the secrets as to how several Fortune 500 @cleanlabai customers are solving the hardest problem in AI -- producing accurate, compliant, safe fully automated AI Agent responses -- at the AI User Group Conference in SF. Stop by and get your hands dirty and

thumb_up_off_alt6

chat_bubble_outline0

repeat3

shareShare

Cleanlab

@cleanlabai

8 months ago

Cleanlab now works with MLflow — making it easier to detect bad LLM responses right in your pipeline. Faster review cycles. Less manual work. We’re joining MLflow’s upcoming meetup to show how it works. 📅 Attend: lu.ma/mlflow423 📝 Blog: mlflow.org/blog/tlm-traci…

Cleanlab now works with <a href="/MLflow/">MLflow</a> — making it easier to detect bad LLM responses right in your pipeline.

Faster review cycles. Less manual work.

We’re joining MLflow’s upcoming meetup to show how it works.

📅 Attend: lu.ma/mlflow423

📝 Blog: mlflow.org/blog/tlm-traci…

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Cleanlab

@cleanlabai

7 months ago

New: Langtrace.ai now includes native support for Cleanlab! Log trust scores, explanations, and metadata for every LLM response—automatically. Instantly surface risky or low-quality outputs. 📝 Blog: langtrace.ai/blog/langtrace… 💻 Docs: docs.langtrace.ai/supported-inte…

New: <a href="/langtrace_ai/">Langtrace.ai</a> now includes native support for Cleanlab!

Log trust scores, explanations, and metadata for every LLM response—automatically. Instantly surface risky or low-quality outputs.

📝 Blog: langtrace.ai/blog/langtrace…

💻 Docs: docs.langtrace.ai/supported-inte…

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare