Randall Balestriero(@randall_balestr) 's Twitter Profileg
Randall Balestriero

@randall_balestr

AI Researcher: From theory to practice (and back)
Postdoc @MetaAI with @ylecun
PhD @RiceUniversity with @rbaraniuk
Masters @ENS_Ulm @Paris_Sorbonne

ID:1246070462679040000

linkhttps://randallbalestriero.github.io calendar_today03-04-2020 13:46:59

410 Tweets

2,6K Followers

228 Following

Sara Hooker(@sarahookr) 's Twitter Profile Photo

Wei-Yin Ko just updated the arxiv preprint for this work:

arxiv.org/abs/2303.00586

Extensive additional experiments across additional datasets and settings.

Congrats to Wei-Yin Ko Daniel D'souza ๎จ€ for leading with Randall Balestriero ๐ŸŽ‰

account_circle
Randall Balestriero(@randall_balestr) 's Twitter Profile Photo

Interestingly the ReLU and Swish relation is well understood from a spline viewpoint akin to the relation between k-NN and isotropic GMM:
deterministic vs probabilistic region assignment!

The same goes for absolute value vs Mish, and many more!

More at openreview.net/forum?id=Syxt2โ€ฆ

Interestingly the ReLU and Swish relation is well understood from a spline viewpoint akin to the relation between k-NN and isotropic GMM: deterministic vs probabilistic region assignment! The same goes for absolute value vs Mish, and many more! More at openreview.net/forum?id=Syxt2โ€ฆ
account_circle
Randall Balestriero(@randall_balestr) 's Twitter Profile Photo

Such issue is hard/costly to mitigate without a clear and applicable mathematical model of Generative AI. One step in that direction was presented in
arxiv.org/abs/2110.08009
arxiv.org/abs/2203.01993
without such provable solution we will always be at the mercy of a discrepancy...

Such issue is hard/costly to mitigate without a clear and applicable mathematical model of Generative AI. One step in that direction was presented in arxiv.org/abs/2110.08009 arxiv.org/abs/2203.01993 without such provable solution we will always be at the mercy of a discrepancy...
account_circle