
Furong Huang
@furongh
Associate professor of @umdcs @umiacs @ml_umd at UMD. Researcher in #AI/#ML, AI #Alignment, #RLHF, #Trustworthy ML, #EthicalAI, AI #Democratization, AI for ALL.
ID: 195674678
https://furong-huang.com/ 27-09-2010 09:11:38
1,1K Tweet
7,7K Followers
2,2K Following







People are always asking for recommendations for other great content to read, but few people find that I maintain a full list of recommendations with my blog Interconnects (link to page below). Here's the list in no structured order: 1. Helen Toner (Helen Toner), Rising Tide:

🧵When training reasoning models, what's the best approach? SFT, Online RL, or perhaps Offline RL? At KRAFTON AI and SK telecom, we've explored this critical question, uncovering interesting insights! Let’s dive deeper, starting with the basics first. 1) SFT SFT (aka hard



Huge congratulations Sanae Lotfi! I absolutely enjoyed your presentation and your dissertation!!!

🧵 New paper from AI Security Institute x EleutherAI that I led with Kyle O’Brien: Open-weight LLM safety is both important & neglected. But we show that filtering dual-use knowledge from pre-training data improves tamper resistance *>10x* over post-training baselines.


Singapore Alignment Workshop videos are live! Hear from Furong Huang Tianwei Zhang Jiaming Ji Animesh Mukherjee Weiyan Shi@ICLR and CHI Yinpeng Dong Cassidy Laidlaw Pin-Yu Chen Baoyuan Wu + more.


💃🕺🪩 DISCO 🪩 🕺💃 is now accepted to EMNLP findings. Congratulations to Yuhang Zhou and collaborators!




