
echo.hive
@hive_echo
🟣1000x Cursor course: tinyurl.com/LearnCursor 🟢 I learn and share my knowledge: echohive.live 🔴 Open Source: github.com/echohive42
ID: 1553828550909648896
https://www.echohive.live 31-07-2022 19:43:25
9,9K Tweet
10,10K Takipçi
722 Takip Edilen


A beautiful paper from MIT+Harvard+ Google DeepMind 👏 Explains why Transformers miss multi digit multiplication and shows a simple bias that fixes it. The researchers trained two small Transformer models on 4-digit-by-4-digit multiplication. One used a special training method
















