Andy J Yang (@pentagonalize) Twitter Tweets • TwiCopy

good girl

@goodgirlxsz

5 hours ago

🔥Telegram İfşa

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

📜New preprint w/ Noah A. Smith and Yanai Elazar that evaluates the novelty of LM-generated text using our n-gram search tool Rusty-DAWG 🐶 Code: github.com/viking-sudo-rm… Paper: arxiv.org/abs/2406.13069

thumb_up_off_alt124

chat_bubble_outline6

repeat16

shareShare

David Chiang

@davidweichiang

a year ago

Joint work with Andy J Yang, Dana Angluin, Peter Cholak, and Anand Pillay

thumb_up_off_alt15

chat_bubble_outline1

repeat3

shareShare

David Chiang

@davidweichiang

a year ago

Congratulations to ctaguchi for winning a Lucy Family Institute for Data & Society Societal Impact Award for his work on creating language technologies for the Kichwa language community in Ecuador! Notre Dame CSE youtube.com/watch?v=HjsNOE…

thumb_up_off_alt17

chat_bubble_outline1

repeat3

shareShare

Monica Gao

@monicag30833795

a year ago

Pigmy hippopotamus Designed by Busi Folded by Monica #origami

thumb_up_off_alt172

chat_bubble_outline4

repeat14

shareShare

Naomi Saphra hiring a lab 🧈🪰

@nsaphra

a year ago

What makes some LM interpretability research “mechanistic”? In our new position paper in BlackboxNLP, Sarah Wiegreffe and I argue that the practical distinction was never technical, but a historical artifact that we should be—and are—moving past to bridge communities.

What makes some LM interpretability research “mechanistic”? In our new position paper in <a href="/BlackboxNLP/">BlackboxNLP</a>, <a href="/sarahwiegreffe/">Sarah Wiegreffe</a> and I argue that the practical distinction was never technical, but a historical artifact that we should be—and are—moving past to bridge communities.

thumb_up_off_alt335

chat_bubble_outline12

repeat57

shareShare

Boycraft

@boycraf19492179

a year ago

Fabの続き。タマコロへの変形まとめて変換しました。テーマ決めてやったほうがモチベあがっていいかも。これで4分の1くらいは終わったかな。カブクワが多いので標本みたいに一つにまとめたほうがいいかもな〜

thumb_up_off_alt97

chat_bubble_outline0

repeat6

shareShare

Yikang Shen

@yikang_shen

a year ago

Stick-Breaking Attention: Out-of-box length extrapolation, thanks to removing the position embedding; Better performance than Softmax+RoPE on almost every task; Similar efficient implementation like Flash Attention. Do we still need Softmax+RoPE for Language Models?

thumb_up_off_alt170

chat_bubble_outline9

repeat27

shareShare

Michael Hahn

@mhahn29

a year ago

When do transformers length-generalize? Generalizing to sequences longer than seen during training is a key challenge for transformers. Some tasks see success, others fail — but *why*? We introduce a theoretical framework to understand and predict length generalization.

thumb_up_off_alt184

chat_bubble_outline3

repeat26

shareShare

ちなり

@chinari_10

a year ago

ネコ Nguyen Hung Cuong #折り紙 #折り紙作品 #Origami

thumb_up_off_alt66,66K

chat_bubble_outline80

repeat8,8K

shareShare

Samuel Cahyawijaya

@scahyawijaya

a year ago

❓ Thank You, Stingray ❓ Today’s LLMs are so powerful with strong general problem solving capabilities spanning across multiple languages. But, can LLMs disambiguate the semantic meaning across different languages? Our new paper delves deeper to answer this very question!

thumb_up_off_alt45

chat_bubble_outline1

repeat9

shareShare

Maziyar PANAHI

@maziyarpanahi

a year ago

The World's First Transformer ASIC You ask, “What would you do with 500,000 tokens per second?” Build real AI applications! Current inference limitations hold us back from doing anything beyond immediately generating the first response for users. etched.com

thumb_up_off_alt59

chat_bubble_outline5

repeat10

shareShare

Kim folder

@kimvacom

a year ago

Automata ; Robotic arm

thumb_up_off_alt110

chat_bubble_outline1

repeat7

shareShare

いずもり・よう

@yizumori

a year ago

蝉チャームドライフ

thumb_up_off_alt339

chat_bubble_outline4

repeat77

shareShare

South Bend Tribune

@sbtribune

a year ago

A new gate was needed at the South Bend airport, so officials decided to upgrade the business center at the same time. southbendtribune.com/story/news/202…

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Yash Sarrof

@yashyrs

a year ago

First-time NeurIPS attendee here! Super excited to talk about our paper with Yana Veitsman, Michael Hahn and to discover the amazing work by everyone else :D neurips.cc/virtual/2024/p…

thumb_up_off_alt13

chat_bubble_outline0

repeat3

shareShare

Satwik Bhattamishra

@satwik1729

a year ago

Excited to head to NeurIPS Conference today! I'll be presenting our work on the representational capabilities of Transformers and RNNs/SSMs. If you're interested in meeting up to discuss research or chat, feel free to reach out via DM or email!

Excited to head to <a href="/NeurIPSConf/">NeurIPS Conference</a> today! I'll be presenting our work on the representational capabilities of Transformers and RNNs/SSMs. If you're interested in meeting up to discuss research or chat, feel free to reach out via DM or email!

thumb_up_off_alt33

chat_bubble_outline3

repeat7

shareShare

David Chiang

@davidweichiang

a year ago

Drop by Andy J Yang's poster tomorrow on the relationship between transformers and first-order logic! neurips.cc/virtual/2024/p… Wed 4:30-7:30 East Exhibit Hall A-C #2310

Drop by <a href="/pentagonalize/">Andy J Yang</a>'s poster tomorrow on the relationship between transformers and first-order logic! neurips.cc/virtual/2024/p… Wed 4:30-7:30 East Exhibit Hall A-C #2310

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Tiago Pimentel

@tpimentelms

a year ago

BPE is a greedy method to find a tokeniser which maximises compression! Why don't we try to find properly optimal tokenisers instead? Well, it seems this is a very difficult—in fact, NP-complete—problem!🤯 New paper + P. Whittington, Gregor Bachmann :) arxiv.org/abs/2412.15210

thumb_up_off_alt429

chat_bubble_outline6

repeat83

shareShare

Harsh Goenka

@hvgoenka

a year ago

Rare footage of the brilliance of Zakir Hussain

thumb_up_off_alt11,11K

chat_bubble_outline122

repeat1,1K

shareShare

Nauseam

@chadnauseam

10 months ago

"A calculator app? Anyone could make that." Not true. A calculator should show you the result of the mathematical expression you entered. That's much, much harder than it sounds. What I'm about to tell you is the greatest calculator app development story ever told.

thumb_up_off_alt34,34K

chat_bubble_outline592

repeat4,4K

shareShare

Andy J Yang

good girl

William Merrill

David Chiang

David Chiang

Monica Gao

Naomi Saphra hiring a lab 🧈🪰

Boycraft

Yikang Shen

Michael Hahn

ちなり

Samuel Cahyawijaya

Maziyar PANAHI

Kim folder

いずもり・よう

South Bend Tribune

Yash Sarrof

Satwik Bhattamishra

David Chiang

Tiago Pimentel

Harsh Goenka

Nauseam