Yuval Shalev (@yuvalshalev1) 's Twitter Profile
Yuval Shalev

@yuvalshalev1

ID: 1681775324839784448

calendar_today19-07-2023 21:17:58

9 Tweet

16 Takipçi

100 Takip Edilen

Gili Lior (@gililior) 's Twitter Profile Photo

🧠🤖 Does learning in the brain inherently require plasticity? In our latest paper we question this assumption, by leveraging insights into how LLMs "learn". Check out this thread for more details! biorxiv.org/content/10.110… w/ Yuval Shalev Gabriel Stanovsky Ariel Goldstein

🧠🤖 Does learning in the brain inherently require plasticity? In our latest paper we question this assumption, by leveraging insights into how LLMs "learn". Check out this thread for more details! biorxiv.org/content/10.110…
w/ <a href="/YuvalShalev1/">Yuval Shalev</a> <a href="/GabiStanovsky/">Gabriel Stanovsky</a> <a href="/GoldsteinYAriel/">Ariel Goldstein</a>
Gili Lior (@gililior) 's Twitter Profile Photo

Exciting news! I'll present my poster at #ACL2024 about unsupervised document structure extraction tomorrow (Aug. 12th) at 12:45 PM 🕒 Come say hi and let's chat over the paper! arxiv.org/pdf/2402.13906 More details below ⬇️ w/ Gabriel Stanovsky (((ل()(ل() 'yoav))))👾 Ai2 HUJI NLP

Exciting news! I'll present my poster at #ACL2024 about unsupervised document structure extraction tomorrow (Aug. 12th) at 12:45 PM 🕒 Come say hi and let's chat over the paper! arxiv.org/pdf/2402.13906 More details below ⬇️
w/ <a href="/GabiStanovsky/">Gabriel Stanovsky</a> <a href="/yoavgo/">(((ل()(ل() 'yoav))))👾</a> <a href="/allen_ai/">Ai2</a> <a href="/nlphuji/">HUJI NLP</a>
Daria Lioubashevski (@darialioub) 's Twitter Profile Photo

📢Paper release📢 What computation is the Transformer performing in the layers after the top-1 becomes fixed (a so called "saturation event")? We show that the next highest-ranked tokens also undergo saturation *in order* of their ranking. Preprint: arxiv.org/abs/2410.20210 1/4

📢Paper release📢
What computation is the Transformer performing in the layers after the top-1 becomes fixed (a so called "saturation event")? We show that the next highest-ranked tokens also undergo saturation *in order* of their ranking.
Preprint:  arxiv.org/abs/2410.20210
1/4
Matan Abudy (@matanabudy) 's Twitter Profile Photo

📄 New paper: "A Minimum Description Length Approach to Regularization in Neural Networks" with Orr Well, Emmanuel Chemla, Roni Katzir, and Nur Lan . We explore why neural networks often struggle with simple structured tasks. Spoiler: our regularizers might be the problem. 🧵