Prasanna Sattigeri (@prasatti) 's Twitter Profile
Prasanna Sattigeri

@prasatti

Principal Research Scientist @IBMResearch and @MITIBMLab.

ID: 42828416

linkhttps://prasannasattigeri.com/ calendar_today27-05-2009 06:11:34

535 Tweet

503 Followers

1,1K Following

elvis (@omarsar0) 's Twitter Profile Photo

IBM open-sources Granite Guardian, a suite of safeguards for risk detection in LLMs. The authors claim that "With AUC scores of 0.871 and 0.854 on harmful content and RAG-hallucination-related benchmarks respectively, Granite Guardian is the most generalizable and competitive

IBM open-sources Granite Guardian, a suite of safeguards for risk detection in LLMs.

The authors claim that "With AUC scores of 0.871 and 0.854 on harmful content and RAG-hallucination-related benchmarks respectively, Granite Guardian is the most generalizable and competitive
Kush Varshney कुश वार्ष्णेय (@krvarshney) 's Twitter Profile Photo

Look at those beautiful Granite Guardian safety vests! #brand #bootleg The Granite Guardian technical report is now on arXiv: arxiv.org/abs/2412.07724 Give it a read to see how the model is state-of-the-art in detecting harmful or hallucinated prompts and responses.

Look at those beautiful Granite Guardian safety vests! #brand #bootleg

The Granite Guardian technical report is now on arXiv: arxiv.org/abs/2412.07724

Give it a read to see how the model is state-of-the-art in detecting harmful or hallucinated prompts and responses.
Kalyan KS (@kalyan_kpl) 's Twitter Profile Photo

Granite Guardian (Safeguard LLMs) This paper introduces the Granite Guardian models, a suite of safeguard LLMs. Granite Guardian models are trained on a unique dataset combining human annotations from diverse sources and synthetic data. These safeguard LLMs provide risk

Granite Guardian (Safeguard LLMs)

This paper introduces the Granite Guardian models, a suite of safeguard LLMs. 

Granite Guardian models are trained on a unique dataset combining human annotations from diverse sources and synthetic data.  

These safeguard LLMs provide risk
Kalyan KS (@kalyan_kpl) 's Twitter Profile Photo

Top LLM Papers of the Week (December Week 2, 2024) [1] EXAONE 3.5 (Open LLMs for Real-world use cases) [2] Granite Guardian (Open Safeguard LLMs) [3] Asynchronous LLM Function Calling [4] Efficient Long-Context LLM Inference for Mid-Range GPUs [5] LLM-based Evaluation Methods

Top LLM Papers of the Week (December Week 2, 2024)

[1] EXAONE 3.5 (Open LLMs for Real-world use cases)
[2] Granite Guardian (Open Safeguard LLMs)
[3] Asynchronous LLM Function Calling
[4] Efficient Long-Context LLM Inference for Mid-Range GPUs
[5] LLM-based Evaluation Methods
Marktechpost AI Research News ⚡ (@marktechpost) 's Twitter Profile Photo

IBM Open-Sources Granite Guardian: A Suite of Safeguards for Risk Detection in LLMs IBM has introduced Granite Guardian, an open-source suite of safeguards for risk detection in LLMs. This suite is designed to detect and mitigate multiple risk dimensions. The Granite Guardian

IBM Open-Sources Granite Guardian: A Suite of Safeguards for Risk Detection in LLMs

IBM has introduced Granite Guardian, an open-source suite of safeguards for risk detection in LLMs. This suite is designed to detect and mitigate multiple risk dimensions. The Granite Guardian
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

A single model that spots everything from toxic prompts to RAG hallucinations in LLM interactions. Granite Guardian introduces risk detection models for LLMs that can identify harmful content, jailbreaks, and RAG-specific hallucination risks with state-of-the-art accuracy.

A single model that spots everything from toxic prompts to RAG hallucinations in LLM interactions.

Granite Guardian introduces risk detection models for LLMs that can identify harmful content, jailbreaks, and RAG-specific hallucination risks with state-of-the-art accuracy.
DAIR.AI (@dair_ai) 's Twitter Profile Photo

10). Granite Guardian - IBM open-sources Granite Guardian, a suite of safeguards for risk detection in LLMs. x.com/omarsar0/statu…

Valerie Chen (@valeriechen_) 's Twitter Profile Photo

Do benchmark improvements translate into downstream developer productivity? In our latest #TMLR paper, we run a study where people write code with models of varying performance. We find that productivity improvements are not proportional to benchmark gains + vary by task!