Wenjie Jacky Mo (@wenjie_jacky_mo) 's Twitter Profile
Wenjie Jacky Mo

@wenjie_jacky_mo

Ph.D. student @ UC DavisπŸš²πŸ„πŸ₯š
Prev. Undergrad @ USC πŸ”΄πŸŸ‘βœŒοΈ
Interest. NLP; AI Safety πŸ€–οΈ

ID: 1854215734865571840

linkhttps://jacky-mo-1111.github.io/ calendar_today06-11-2024 17:34:35

7 Tweet

2 Followers

17 Following

🌴Muhao Chen🌴 (@muhao_chen) 's Twitter Profile Photo

🚨 Call for Papers! ACL 2025 🚨 LLM Security Workshop @ ACL 2025 (the first workshop of ACL SIGSEC) πŸ” Topics: Adversarial attacks, defenses, vulnerabilities, ethical & legal aspects, safe deployment of LLMs and more πŸ“… Submission Deadline: April 15, 2025 πŸ“ August 1, 2025 in

Xiaofei Wen (@xiaofei_wen_mk) 's Twitter Profile Photo

Can LLM guardrails think twice before deciding? ✨ Check out our #ACL2025 paper: THINKGUARD β€” a critique-augmented safety guardrail! βœ… Structured critiques βœ… Interpretable decisions βœ… Robust against adversarial prompts πŸ“‘ arxiv.org/abs/2502.13458 🧡[1/n]

Can LLM guardrails think twice before deciding?

✨ Check out our #ACL2025 paper: THINKGUARD β€” a critique-augmented safety guardrail!
βœ… Structured critiques
βœ… Interpretable decisions
βœ… Robust against adversarial prompts

πŸ“‘ arxiv.org/abs/2502.13458
🧡[1/n]
Tinghui Zhu (@darthzhu_) 's Twitter Profile Photo

😴 Extending modality based on an LLM has been a common practice when we are talking about multimodal LLMs. ❓ Can it generalize to omni-modality? We study the effects of extending modality and ask three questions: arxiv.org/abs/2506.01872 #LLM #MLLM #OmniModality

Qin Liu (@qinliu_nlp) 's Twitter Profile Photo

🚨 New paper accepted to #ACL2025! We propose SudoLM, a framework that lets LLMs learn access control over parametric knowledge. Rather than blocking everyone from sensitive knowledge, SudoLM grants access to authorized users only. Paper: arxiv.org/abs/2410.14676… 🧡[1/6]πŸ‘‡

🚨 New paper accepted to #ACL2025!
We propose SudoLM, a framework that lets LLMs learn access control over parametric knowledge.
Rather than blocking everyone from sensitive knowledge, SudoLM grants access to authorized users only.
Paper: arxiv.org/abs/2410.14676…
🧡[1/6]πŸ‘‡