Wenjie Jacky Mo (@wenjie_jacky_mo) 's Twitter Profile
Wenjie Jacky Mo

@wenjie_jacky_mo

Ph.D. student @ UC Davis🚲🐄🥚
Prev. Undergrad @ USC 🔴🟡✌️
Interest. NLP; AI Safety 🤖️

ID: 1854215734865571840

linkhttps://jacky-mo-1111.github.io/ calendar_today06-11-2024 17:34:35

7 Tweet

2 Followers

17 Following

🌴Muhao Chen🌴 (@muhao_chen) 's Twitter Profile Photo

🚨 Call for Papers! ACL 2025 🚨 LLM Security Workshop @ ACL 2025 (the first workshop of ACL SIGSEC) 🔐 Topics: Adversarial attacks, defenses, vulnerabilities, ethical & legal aspects, safe deployment of LLMs and more 📅 Submission Deadline: April 15, 2025 📍 August 1, 2025 in

Xiaofei Wen (@xiaofei_wen_mk) 's Twitter Profile Photo

Can LLM guardrails think twice before deciding? ✨ Check out our #ACL2025 paper: THINKGUARD — a critique-augmented safety guardrail! ✅ Structured critiques ✅ Interpretable decisions ✅ Robust against adversarial prompts 📑 arxiv.org/abs/2502.13458 🧵[1/n]

Can LLM guardrails think twice before deciding?

✨ Check out our #ACL2025 paper: THINKGUARD — a critique-augmented safety guardrail!
✅ Structured critiques
✅ Interpretable decisions
✅ Robust against adversarial prompts

📑 arxiv.org/abs/2502.13458
🧵[1/n]
Tinghui Zhu (@darthzhu_) 's Twitter Profile Photo

😴 Extending modality based on an LLM has been a common practice when we are talking about multimodal LLMs. ❓ Can it generalize to omni-modality? We study the effects of extending modality and ask three questions: arxiv.org/abs/2506.01872 #LLM #MLLM #OmniModality

Qin Liu (@qinliu_nlp) 's Twitter Profile Photo

🚨 New paper accepted to #ACL2025! We propose SudoLM, a framework that lets LLMs learn access control over parametric knowledge. Rather than blocking everyone from sensitive knowledge, SudoLM grants access to authorized users only. Paper: arxiv.org/abs/2410.14676… 🧵[1/6]👇

🚨 New paper accepted to #ACL2025!
We propose SudoLM, a framework that lets LLMs learn access control over parametric knowledge.
Rather than blocking everyone from sensitive knowledge, SudoLM grants access to authorized users only.
Paper: arxiv.org/abs/2410.14676…
🧵[1/6]👇