Genglin Liu (@genglin_liu) Twitter Tweets • TwiCopy

Dan Hendrycks

a year ago

As an alternative to RLHF and adversarial training, we released short-circuiting. It makes models ~100x more robust. It works for LLMs, multimodal models, and agents. Unlike before, I now think robustly stopping models from generating harmful outputs may be highly tractable and

thumb_up_off_alt619

chat_bubble_outline25

repeat93

shareShare

Heng Ji

@hengjinlp

a year ago

We have won two NAACL2024 Outstanding Paper Awards! Congratulations to Chi Han, Shizhe Diao, Yi Fung, Xingyao Wang, Yangyi Chen and all students and collaborators! Chi Han Chi Han will be on academic job market next year! arxiv.org/pdf/2308.16137 arxiv.org/pdf/2311.09677

thumb_up_off_alt224

chat_bubble_outline17

repeat13

shareShare

Shizhe Diao

@shizhediao

a year ago

Excited to share our R-Tuning got an outstanding paper award@NAACL 2024! Take a look at this paper to see how to align your LLMs to honesty. arxiv.org/abs/2311.09677 This work is finished during my visit at UIUC. Thanks for Prof. Ji and Prof. Zhang’s supervision!

thumb_up_off_alt77

chat_bubble_outline12

repeat10

shareShare

Chi Han

@glaciohound

a year ago

🎖 Excited to receive an outstanding paper award at NAACL2024 for LM-Infinite "Zero-Shot Extreme Length Generalization for Large Language Models" work! We extend to 200M length with no parameter updates, with downstream improvements arxiv.org/abs/2308.16137 github.com/Glaciohound/LM…

thumb_up_off_alt48

chat_bubble_outline5

repeat7

shareShare

Zeyuan Allen-Zhu, Sc.D.

@zeyuanallenzhu

a year ago

If you're attending ICML 2024, join my 2-hour tutorial on Monday July 22 to explore the Physics of Language Model - all 6 parts. Visit: physics.allen-zhu.com and it will be live-streamed on Zoom. BONUS: this is the premiere of Part 2.1 + 2.2, don't miss out! #ICML2024 #MetaAI

thumb_up_off_alt872

chat_bubble_outline18

repeat168

shareShare

Haoyi Qiu

@haoyiqiu

a year ago

🌐 Are LLM agents prepared to navigate the rich diversity of cultural and social norms? 🏠 CASA tests them on real-world tasks like online shopping and social discussion forums, revealing that current agents show less than 10% awareness and over 40% norm violations. 🧠 We’re

thumb_up_off_alt127

chat_bubble_outline4

repeat33

shareShare

Yu (Bryan) Zhou

@yu_bryan_zhou

a year ago

📢 A single line of code to thoroughly evaluate your LLM for Embodied Decision Making 📢 Please checkout our new NeurIPS D&B Oral Paper!! (Part-1 of my summer intern works Stanford Vision and Learning Lab)

thumb_up_off_alt39

chat_bubble_outline0

repeat4

shareShare

uclanlp

@uclanlp

a year ago

UCLANLP and alumni at #EMNLP2024 social event. What a group!

thumb_up_off_alt85

chat_bubble_outline0

repeat10

shareShare

Liwei Jiang

@liweijianglw

10 months ago

I'm thrilled to share that our Delphi paper is officially published today at Nature Machine Intelligence after almost four years of hard works from all my amazing collaborators (a quite insane timeline considering the rapid AI world)! Special thanks to the unwavering support of my advisor,

I'm thrilled to share that our Delphi paper is officially published today at <a href="/NatMachIntell/">Nature Machine Intelligence</a> after almost four years of hard works from all my amazing collaborators (a quite insane timeline considering the rapid AI world)! Special thanks to the unwavering support of my advisor,

thumb_up_off_alt181

chat_bubble_outline11

repeat27

shareShare

Zhenhailong Wang

@zhenhailongw

10 months ago

📱Current mobile agents struggle with real-world tasks that align with human needs—like finding the best deal across 3 apps. 💸 Introducing Mobile-Agent-E: a novel mobile assistant designed for complex, long-horizon tasks and capable of self-evolving🐣🐥through experience. 🧵1/3

thumb_up_off_alt29

chat_bubble_outline1

repeat9

shareShare

Yuji Zhang

@yuji_zhang_nlp

9 months ago

🔍New findings of knowledge overshadowing! Why do LLMs hallucinate over all true training data? 🤔Can we predict hallucinations even before model training or inference? 🚀Check out our new preprint: [arxiv.org/pdf/2502.16143] The Law of Knowledge Overshadowing: Towards

thumb_up_off_alt123

chat_bubble_outline6

repeat34

shareShare

Salman

@salman1422571

7 months ago

🚨 Excited to share our new paper on 𝕏-Teaming! 🤖 Multiagent system for multiturn jaibreaking 🔍 96.2% attack success against Claude 3.7 (immune to single-turn attacks!) 💥 Upto 98.1% attack success on leading model 🛡️ Released 30K safety dataset 🧵below #AI #LLMSafety

thumb_up_off_alt37

chat_bubble_outline2

repeat10

shareShare

Salman

@salman1422571

5 months ago

🚨Thrilled to share our new work: AI debate combats misinformation better than single AI advisors! 🤔We tested if two AIs debating opposite sides helps biased humans judge controversial COVID-19 claims more accurately. Paper: arxiv.org/abs/2506.02175 🧵👇 #AI #Debate

thumb_up_off_alt14

chat_bubble_outline1

repeat7

shareShare

Liwei Jiang

@liweijianglw

2 months ago

Wondering whether AI debates can drive biased perspectives toward truth? Our answer is YES and this scalable oversight work is now accepted to #NeurIPS2025 ! Finally bringing a large-scale human study into an AI conference! (+++ my first time as a last-ish author is very fun!

thumb_up_off_alt43

chat_bubble_outline1

repeat6

shareShare

Liwei Jiang

@liweijianglw

2 months ago

(Thu Oct 9, 11:00am–1:00pm) Poster Session 5 𝐏𝐨𝐬𝐭𝐞𝐫 #𝟒𝟒: X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents; w/ amazing co-leads Salman James Shiffer In this work, we introduce a 𝐜𝐨𝐦𝐩𝐫𝐞𝐡𝐞𝐧𝐬𝐢𝐯𝐞 and 𝐞𝐚𝐬𝐲-𝐭𝐨-𝐫𝐮𝐧

(Thu Oct 9, 11:00am–1:00pm) Poster Session 5

𝐏𝐨𝐬𝐭𝐞𝐫 #𝟒𝟒: X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents; w/ amazing co-leads <a href="/salman1422571/">Salman</a> <a href="/jamesnshiffer/">James Shiffer</a>

In this work, we introduce a 𝐜𝐨𝐦𝐩𝐫𝐞𝐡𝐞𝐧𝐬𝐢𝐯𝐞 and 𝐞𝐚𝐬𝐲-𝐭𝐨-𝐫𝐮𝐧

thumb_up_off_alt5

chat_bubble_outline0

repeat4

shareShare