
CuddlySalmon | nptacek.eth
@nptacek
ai artistry and artifice | code | VR/AR/XR
ID: 7284012
https://linktr.ee/cuddlysalmon 06-07-2007 06:05:33
40,40K Tweet
6,6K Takipçi
2,2K Takip Edilen








I've been yapping for months about bad evaluation setups and how results/AI behaviors are reported, and this new AI Security Institute paper does so much more clearly. In short: There's a massive difference between showing a model can do something sketchy versus showing it tends to










