Prasanna Sattigeri (@prasatti) Twitter Tweets • TwiCopy

Prasanna Sattigeri

@prasatti

+ Follow

Principal Research Scientist @IBMResearch and @MITIBMLab.

ID: 42828416

linkhttps://prasannasattigeri.com/ calendar_today27-05-2009 06:11:34

535 Tweet

503 Followers

1,1K Following

elvis

@omarsar0

a year ago

IBM open-sources Granite Guardian, a suite of safeguards for risk detection in LLMs. The authors claim that "With AUC scores of 0.871 and 0.854 on harmful content and RAG-hallucination-related benchmarks respectively, Granite Guardian is the most generalizable and competitive

thumb_up_off_alt109

chat_bubble_outline6

repeat26

shareShare

Kush Varshney कुश वार्ष्णेय

@krvarshney

a year ago

Look at those beautiful Granite Guardian safety vests! #brand #bootleg The Granite Guardian technical report is now on arXiv: arxiv.org/abs/2412.07724 Give it a read to see how the model is state-of-the-art in detecting harmful or hallucinated prompts and responses.

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Kalyan KS

@kalyan_kpl

a year ago

Granite Guardian (Safeguard LLMs) This paper introduces the Granite Guardian models, a suite of safeguard LLMs. Granite Guardian models are trained on a unique dataset combining human annotations from diverse sources and synthetic data. These safeguard LLMs provide risk

thumb_up_off_alt6

chat_bubble_outline0

repeat4

shareShare

Kalyan KS

@kalyan_kpl

a year ago

Top LLM Papers of the Week (December Week 2, 2024) [1] EXAONE 3.5 (Open LLMs for Real-world use cases) [2] Granite Guardian (Open Safeguard LLMs) [3] Asynchronous LLM Function Calling [4] Efficient Long-Context LLM Inference for Mid-Range GPUs [5] LLM-based Evaluation Methods

thumb_up_off_alt6

chat_bubble_outline0

repeat4

shareShare

Marktechpost AI Research News ⚡

@marktechpost

a year ago

IBM Open-Sources Granite Guardian: A Suite of Safeguards for Risk Detection in LLMs IBM has introduced Granite Guardian, an open-source suite of safeguards for risk detection in LLMs. This suite is designed to detect and mitigate multiple risk dimensions. The Granite Guardian

thumb_up_off_alt17

chat_bubble_outline0

repeat7

shareShare

Rohan Paul

@rohanpaul_ai

a year ago

A single model that spots everything from toxic prompts to RAG hallucinations in LLM interactions. Granite Guardian introduces risk detection models for LLMs that can identify harmful content, jailbreaks, and RAG-specific hallucination risks with state-of-the-art accuracy.

thumb_up_off_alt7

chat_bubble_outline2

repeat8

shareShare

DAIR.AI

@dair_ai

a year ago

10). Granite Guardian - IBM open-sources Granite Guardian, a suite of safeguards for risk detection in LLMs. x.com/omarsar0/statu…

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

Prasanna Sattigeri

@prasatti

10 months ago

Nice article discussing Deepseek and the evolving landscape for AI players! ibm.com/think/news/dee…

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Justine Moore

@venturetwins

10 months ago

I find these "tiny people doing things" videos so calming

thumb_up_off_alt1,1K

chat_bubble_outline65

repeat138

shareShare

Valerie Chen

@valeriechen_

8 months ago

Do benchmark improvements translate into downstream developer productivity? In our latest #TMLR paper, we run a study where people write code with models of varying performance. We find that productivity improvements are not proportional to benchmark gains + vary by task!

thumb_up_off_alt9

chat_bubble_outline1

repeat3

shareShare