Yiwei Wang (@wangyiw33973985) Twitter Tweets • TwiCopy

Chaowei Xiao

2 years ago

🚀 Introducing AutoDAN, a method that automatically generates SEMANTICALLY MEANINGFUL #Jailbreak prompts for #redteaming aligned #LLMs . arxiv: arxiv.org/pdf/2310.04451…

thumb_up_off_alt106

chat_bubble_outline3

repeat33

shareShare

Kai-Wei Chang

@kaiwei_chang

a year ago

I'm grateful for the opportunity to serve as SIGDAT officer. Thank you all for the support and ideas contributing to improving our community. 🥰 sigdat.org/organization

thumb_up_off_alt46

chat_bubble_outline1

repeat4

shareShare

🔥 Unlocking the power of Abstract Meaning Representations, AMRFact generates coherent, factually inconsistent summaries with high error-type coverage to improve the factuality evaluation on abstractive summarization! 📣 Check out our new #NAACL2024🇲🇽work: arxiv.org/abs/2311.09521

thumb_up_off_alt104

chat_bubble_outline5

repeat32

shareShare

Violet Peng

@violetnpeng

a year ago

Proud of this work where we show event detection can generalize from one epidemic to another by identifying epidemic-related event types (e.g. symptoms) even if the actual mentions (e.g. of symptoms) are distinctive. So training on COVID, we can generalize to Monkeypox! #NAACL24

thumb_up_off_alt30

chat_bubble_outline0

repeat5

shareShare

Yu Yang

@yuyang_i

a year ago

Excited about training on synthetic data? Different stages of training might need different synthetic data! 🧠💡 Check out our #ICLR2024 paper on Progressive Dataset Distillation (PDD😉) at PS#2 Halle B#9! It tailors synthetic data to each training stage for better performance!

thumb_up_off_alt74

chat_bubble_outline0

repeat9

shareShare

🌴Muhao Chen🌴

@muhao_chen

a year ago

I have to miss #ICLR2024 due to teaching, but would still like to shoutout for our UniversalNER. This is an extremely strong open NER system that provides precise recognition in many domains and for any new entity types (outperforming ChatGPT by 9% on 43 NER datasets across 9

thumb_up_off_alt36

chat_bubble_outline2

repeat4

shareShare

Yiwei Wang

@wangyiw33973985

a year ago

🎳Jailbreak of "Safe" Large Language Models is as simple as a string replacement. 💥Our recent research finds that specifying an output prefix of LLMs will make the jailbreak easy and effective. researchsquare.com/article/rs-438…

thumb_up_off_alt28

chat_bubble_outline1

repeat7

shareShare

Axel Darmouni

@adarmouni

a year ago

Jailbreaking through asking for coherent outputs 🧵📖 Read of the day, day 49: Frustratingly Easy Jailbreak of Large Language Models via Output Prefix Attacks, by Yiwei Wang et al from UCLA Yet another way of breaking through LLM’s defense systems found. This idea is

thumb_up_off_alt3

chat_bubble_outline1

repeat2

shareShare

UCLA Computer Science

@uclacomsci

a year ago

Professor Nanyun (Violet) Peng Receives NSF CAREER Award Read more: cs.ucla.edu/professor-nany…

thumb_up_off_alt31

chat_bubble_outline0

repeat2

shareShare

Byron

@byron52238498

a year ago

🚀Excited to share our latest research on knowledge editing (KE) in large language models! We unveil a novel approach, DeCK, which enhances in-context editing (ICE) by addressing stubborn knowledge that is tough to edit. DeCK boosts ICE performance by up to 219% on MQuAKE! 🚀

thumb_up_off_alt2

chat_bubble_outline1

repeat2

shareShare

Wenxuan Zhou

@wenxuanzhou_96

a year ago

Introducing WPO: Enhancing RLHF with Weighted Preference Optimization 🌟 Our new preference optimization method reweights preference data to simulate on-policy preference optimization using off-policy data, combining efficiency with high performance. ✅ up to 5.6% better than

thumb_up_off_alt20

chat_bubble_outline5

repeat10

shareShare

Fei Wang

@fwang_nlp

a year ago

🌟 𝐌𝐮𝐥𝐭𝐢𝐦𝐨𝐝𝐚𝐥 𝐃𝐏𝐎🌟 🔍 DPO over-prioritizes language-only preference 🚀 Introducing mDPO: optimizes image-conditioned preference 🏆 Best 3B MLLM with reduced hallucination, beats LLaVA 7/13B with DPO Collaboration with Microsoft Research huggingface.co/papers/2406.11…

thumb_up_off_alt89

chat_bubble_outline3

repeat38

shareShare

AIDB

@ai_database

a year ago

LLM研究者本人による解説記事が出ました。 ai-data-base.com/archives/72359 中国科学院大学（University of Chinese Academy of Sciences）のBaolong Bi氏らによる「頑固な知識」の編集アプローチに関する論文です。

thumb_up_off_alt105

chat_bubble_outline1

repeat26

shareShare

Andrew

@andrewmichaelio

a year ago

🚨 NEW OPENAI MODEL: o1 “o1 spends more time thinking before it responds. In a qualifying exam for the International Mathematics Olympiad (IMO), GPT-4o correctly solved only 13% of problems, while the reasoning model scored 83%. Their coding abilities were evaluated in

thumb_up_off_alt118

chat_bubble_outline3

repeat20

shareShare

Nicolas Bustamante

@nicbstme

a year ago

o1-preview is 100x more expensive than GPT-4o mini, costing $15 per million input tokens compared to GPT-4o mini's $0.15.

thumb_up_off_alt26

chat_bubble_outline2

repeat3

shareShare

Zhengzhong Tu

@_vztu

a year ago

🚨Vision-Language Models (VLMs) are truly amazing. Ever wonder if their visual and textual "brains" always agree? I am excited to share our latest paper, where we tackle a critical challenge in VLMs, dubbed the 𝐜𝐫𝐨𝐬𝐬-𝐦𝐨𝐝𝐚𝐥𝐢𝐭𝐲 𝐩𝐚𝐫𝐚𝐦𝐞𝐭𝐫𝐢𝐜 𝐤𝐧𝐨𝐰𝐥𝐞𝐝𝐠𝐞

thumb_up_off_alt294

chat_bubble_outline2

repeat63

shareShare

Bingxuan Li

@bingxuan_l

7 months ago

⚙️ Introducing METAL! A multi-agent framework to generate charts that precisely replicate visual details in the reference. 📈 We show that test-time scaling with the multi-agent system can bring 5.2% gain over the current best result on ChatMIMIC! 🌐 metal-chart-generation.github.io

thumb_up_off_alt12

chat_bubble_outline1

repeat3

shareShare

Yiwei Wang

@wangyiw33973985

3 months ago

Sharing a new GitHub repository for collecting and sharing papers on the emerging topic of Context Engineering, which has seen broad adoption in industry: github.com/Meirtz/Awesome… A corresponding survey paper is also coming soon. Thanks for reading! ☀️

thumb_up_off_alt2

chat_bubble_outline0

repeat2

shareShare

Yiwei Wang

@wangyiw33973985

2 months ago

📄 A Survey of Context Engineering for Large Language Models 🧠 arXiv link: arxiv.org/abs/2507.13334 Context Engineering is the art of generating, acquiring, processing, and managing contextual information for language model agents. 📚 GitHub project: github.com/Meirtz/Awesome…

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Yiwei Wang

Chaowei Xiao

Kai-Wei Chang

Haoyi Qiu

Violet Peng

Yu Yang

🌴Muhao Chen🌴

Yiwei Wang

Axel Darmouni

UCLA Computer Science

Byron

Wenxuan Zhou

Fei Wang

AIDB

Andrew

Nicolas Bustamante

Zhengzhong Tu

Bingxuan Li

Yiwei Wang

Yiwei Wang