Lucas Uzal
@lucas_uzal
PhD in Physics. Former Professor. Co-founder of Teramot. Programmed my first neural network in C from scratch in 2005. Following AI progress since then.
ID: 1170513098651983872
08-09-2019 01:45:07
1,1K Tweet
375 Takipçi
1,1K Takip Edilen
The (true) story of development and inspiration behind the "attention" operator, the one in "Attention is All you Need" that introduced the Transformer. From personal email correspondence with the author 🇺🇦 Dzmitry Bahdanau @ NeurIPS ~2 years ago, published here and now (with permission) following