Ethan Perez
@ethanjperez
Large language model safety
ID: 908728623988953089
https://scholar.google.com/citations?user=za0-taQAAAAJ 15-09-2017 16:26:02
1,1K Tweet
7,7K Followers
507 Following
Excited to share that our 🌈 Rainbow Teaming method has been used to evaluate and enhance the adversarial robustness of Llama 3.1 models! Originally an exploratory project co-led with Andrei Lupu and Sharath Raparthy, it has contributed to the biggest release of the year!
John Schulman Very excited to be working together again!
This is a tough call and will make some people upset, but, all things considered, I think California should probably pass the SB 1047 AI safety bill. For over 20 years, I have been an advocate for AI regulation, just as we regulate any product/technology that is a potential risk