felfri (@felix_friedri) 's Twitter Profile
felfri

@felix_friedri

PhD @ TU Darmstadt, hessian.AI

ID: 1629059742877339648

calendar_today24-02-2023 10:05:21

4 Tweet

29 Followers

4 Following

Manuel Brack (@mbrack_aiml) 's Twitter Profile Photo

I'm thrilled to announce LlavaGuard, a family of VLM-based safeguard models that offers a versatile framework for evaluating the safety compliance of visual content. ml-research.github.io/human-centered…

I'm thrilled to announce LlavaGuard, a family of VLM-based safeguard models that offers a versatile framework for evaluating the safety compliance of visual content.

ml-research.github.io/human-centered…
Usman Gohar (@usmangohar) 's Twitter Profile Photo

🚨Excited to share that the V2 of the social impact paper, led by the incredible Irene Solaiman and Zeerak (زیرک ظلعت) is on Zeerak@ mastodon|bsky, is finally out! In this work, we present a guide for evaluating the social impact of Gen AI systems across categories & modalities. 🧵(1/7) arxiv.org/pdf/2306.05949

Paul Röttger (@paul_rottger) 's Twitter Profile Photo

Today, we are releasing MSTS, a new Multimodal Safety Test Suite for vision-language models! MSTS is exciting because it tests for safety risks *created by multimodality*. Each prompt consists of a text + image that *only in combination* reveal their full unsafe meaning. 🧵

Today, we are releasing MSTS, a new Multimodal Safety Test Suite for vision-language models!

MSTS is exciting because it tests for safety risks *created by multimodality*. Each prompt consists of a text + image that *only in combination* reveal their full unsafe meaning.

🧵