felfri (@felix_friedri) Twitter Tweets • TwiCopy

felfri

@felix_friedri

+ Follow

PhD @ TU Darmstadt, hessian.AI

ID: 1629059742877339648

calendar_today24-02-2023 10:05:21

4 Tweet

29 Followers

4 Following

felfri

@felix_friedri

2 years ago

#keepAIopen

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

I'm thrilled to announce LlavaGuard, a family of VLM-based safeguard models that offers a versatile framework for evaluating the safety compliance of visual content. ml-research.github.io/human-centered…

thumb_up_off_alt49

chat_bubble_outline1

repeat13

shareShare

Usman Gohar

@usmangohar

a year ago

🚨Excited to share that the V2 of the social impact paper, led by the incredible Irene Solaiman and Zeerak (زیرک ظلعت) is on Zeerak@ mastodon|bsky, is finally out! In this work, we present a guide for evaluating the social impact of Gen AI systems across categories & modalities. 🧵(1/7) arxiv.org/pdf/2306.05949

thumb_up_off_alt29

chat_bubble_outline6

repeat8

shareShare

Paul Röttger

@paul_rottger

9 months ago

Today, we are releasing MSTS, a new Multimodal Safety Test Suite for vision-language models! MSTS is exciting because it tests for safety risks *created by multimodality*. Each prompt consists of a text + image that *only in combination* reveal their full unsafe meaning. 🧵

thumb_up_off_alt48

chat_bubble_outline1

repeat20

shareShare

felfri

felfri

Manuel Brack

Usman Gohar

Paul Röttger