Qin Liu (@qinliu_nlp) Twitter Tweets • TwiCopy

Qin Liu

a year ago

🌟 Check out our latest comprehensive survey on: 🌟 ⚠️Emergent backdoor threats to LLMs 👻Safety challenges to LLMs 💡Future research directions in this area Invited paper at 60th Annual Allerton Conference: ieeexplore.ieee.org/abstract/docum…

thumb_up_off_alt7

chat_bubble_outline0

repeat3

shareShare

Bowen Jin

@bowenjin13

10 months ago

🚀 Introducing 𝗦𝗲𝗮𝗿𝗰𝗵-𝗥𝟭 – the first 𝗿𝗲𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗼𝗳 𝗗𝗲𝗲𝗽𝘀𝗲𝗲𝗸-𝗥𝟭 (𝘇𝗲𝗿𝗼) for training reasoning and search-augmented LLM agents with reinforcement learning! This is a step towards training an 𝗼𝗽𝗲𝗻-𝘀𝗼𝘂𝗿𝗰𝗲 𝗢𝗽𝗲𝗻𝗔𝗜 “𝗗𝗲𝗲𝗽

thumb_up_off_alt2,2K

chat_bubble_outline45

repeat326

shareShare

Sheng Zhang

@sheng_zh

9 months ago

🚀 Excited to share MetaScale, our latest work advancing LLM reasoning capabilities! MetaScale empowers GPT-4o to match or even surpass frontier reasoning models like o1, Claude-3.5-Sonnet, and o1-mini on the challenging Arena-Hard benchmark (lmarena.ai). Additionally, MetaScale

thumb_up_off_alt105

chat_bubble_outline0

repeat28

shareShare

Wenjie Jacky Mo

@wenjie_jacky_mo

9 months ago

Worried about backdoors in LLMs? 🌟 Check out our #NAACL2025 work on test-time backdoor mitigation! ✅ Black-box 📦 ✅ Plug-and-play 🛡️ We explore: → Defensive Demonstrations 🧪 → Self-generated Prefixes 🧩 → Self-refinement ✍️ 📄 arxiv.org/abs/2311.09763 🧵[1/n]

thumb_up_off_alt8

chat_bubble_outline1

repeat6

shareShare

🌴Muhao Chen🌴

@muhao_chen

8 months ago

🚨 Call for Papers! ACL 2025 🚨 LLM Security Workshop @ ACL 2025 (the first workshop of ACL SIGSEC) 🔐 Topics: Adversarial attacks, defenses, vulnerabilities, ethical & legal aspects, safe deployment of LLMs and more 📅 Submission Deadline: April 15, 2025 📍 August 1, 2025 in

thumb_up_off_alt25

chat_bubble_outline0

repeat14

shareShare

Fei Wang

@fwang_nlp

8 months ago

🎉 Excited to share that our paper, "MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding", will be presented at #ICLR2025! 📅 Date: April 24 🕒 Time: 3:00 PM 📍 Location: Hall 3 + Hall 2B #11 MuirBench challenges multimodal LLMs with diverse multi-image

thumb_up_off_alt53

chat_bubble_outline0

repeat17

shareShare

Hadi Askari

@hadiaskari67

8 months ago

🧵1/ Excited to share our #NAACL2025 work! 🎉 "Assessing LLMs for Zero-Shot Abstractive Summarization Through the Lens of Relevance Paraphrasing" We study how robust LLM summarization is to our relevance paraphrasing method? 🧠📝 More details below:👇 arxiv.org/abs/2406.03993

thumb_up_off_alt15

chat_bubble_outline1

repeat7

shareShare

Xiaofei Wen

@xiaofei_wen_mk

7 months ago

Can LLM guardrails think twice before deciding? ✨ Check out our #ACL2025 paper: THINKGUARD — a critique-augmented safety guardrail! ✅ Structured critiques ✅ Interpretable decisions ✅ Robust against adversarial prompts 📑 arxiv.org/abs/2502.13458 🧵[1/n]

thumb_up_off_alt12

chat_bubble_outline1

repeat10

shareShare

Tinghui Zhu

@darthzhu_

6 months ago

😴 Extending modality based on an LLM has been a common practice when we are talking about multimodal LLMs. ❓ Can it generalize to omni-modality? We study the effects of extending modality and ask three questions: arxiv.org/abs/2506.01872 #LLM #MLLM #OmniModality

thumb_up_off_alt12

chat_bubble_outline1

repeat10

shareShare

jakedineenasu

@jakedineenasu

6 months ago

🔍 Introducing QA-LIGN: A reflective alignment approach using a draft→reflection→revision pipeline. We create symbolic reward models that serve as both natural language critics & general reward models, bridging rule-based rewards and RLAIF. 📄 Paper: arxiv.org/pdf/2506.08123

thumb_up_off_alt4

chat_bubble_outline1

repeat6

shareShare

Wenjie Jacky Mo

@wenjie_jacky_mo

5 months ago

ACLRollingReview EMNLP 2025 Urgent help needed. acFZ: initial score 3 🧊 Complete silence during discussion. ⏰ 4am PST, 9 min before deadline: quietly drops to 2. with “Thanks for the rebuttal. I have updated the score.” ⚠️ No explanation. No notice. No chance to respond. (0/n)

<a href="/ReviewAcl/">ACLRollingReview</a> <a href="/emnlpmeeting/">EMNLP 2025</a> Urgent help needed.

acFZ: initial score 3

🧊 Complete silence during discussion.
⏰ 4am PST, 9 min before deadline: quietly drops to 2.
with “Thanks for the rebuttal. I have updated the score.”
⚠️ No explanation. No notice. No chance to respond.
(0/n)

thumb_up_off_alt29

chat_bubble_outline6

repeat3

shareShare

Tenghao Huang

@tenghaohuang45

5 months ago

🎉 Excited to share our ACL 2025 paper: 🤖R2D2: Remembering, Replaying and Dynamic Decision Making with a Reflective Agentic Memory 🧠 📄 Paper: arxiv.org/abs/2501.12485 📍Poster: Hall 4/5, Session 4 Wednesday, July 30 11:00-12:30 🧵👇

thumb_up_off_alt20

chat_bubble_outline1

repeat9

shareShare

Dongwon Jung

@dong_w0n

4 months ago

Excited to share that two of my first-author papers were accepted to #EMNLP2025! ✨📚 1️⃣ Code Execution as Grounded Supervision for LLM Reasoning (Main) 2️⃣ Familiarity-Aware Evidence Compression for Retrieval-Augmented Generation (Findings) Huge thanks to my collaborators🙌

thumb_up_off_alt12

chat_bubble_outline2

repeat6

shareShare

jakedineenasu

@jakedineenasu

4 months ago

Thrilled to share QA-LIGN 𝐚𝐭 #EMNLP2025! Bridging rule-based rewards and LLM-as-a-Judge via LLM-derived symbolic reward rubrics. 🔗 arxiv.org/pdf/2506.08123

thumb_up_off_alt8

chat_bubble_outline0

repeat3

shareShare