Ted (@tedinreallife) Twitter Tweets • TwiCopy

Ted

@tedinreallife

3 years ago

Almost a Kessler syndrome for LLMs arxiv.org/abs/2305.17493 arxiv.org/abs/2307.01850

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

The AI world is in a GPU crunch and meanwhile NERSC is offering 50% off its A100 GPUs (rest.nersc.gov/REST/announcem…). They could make a killing by backfilling idle capacity with commercial workloads 💰 💰 💰

thumb_up_off_alt18

chat_bubble_outline3

repeat3

shareShare

Ted

@tedinreallife

2 years ago

$0.25/mile

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Ted

@tedinreallife

2 years ago

arxiv.org/abs/2308.13418

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ted

@tedinreallife

2 years ago

Finally found the introverts' dining area at SJC. 🙂

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Soumith Chintala

@soumithchintala

2 years ago

Regulation starts at roughly two orders of magnitude larger than a ~70B Transformer trained on 2T tokens -- which is ~5e24. Note: increasing the size of the dataset OR the size of the transformer increases training flops. The (rumored) size of GPT-4 is regulated.

thumb_up_off_alt376

chat_bubble_outline21

repeat53

shareShare

Christian Szegedy

@chrszegedy

2 years ago

Inception used 1.5X less compute than AlexNet and 12X less than VGG, outperforming both. The trend continued with mobile net... etc. IMO, today's LLMs are insanely inefficient/compute. Regulations that impose limits on the amount of compute spent on AI training will just

thumb_up_off_alt248

chat_bubble_outline14

repeat18

shareShare

Ted

@tedinreallife

2 years ago

So tempting to just group a collection of "specialized"s and call it "general" arxiv.org/abs/2311.02462

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Christian Szegedy

@chrszegedy

2 years ago

Today's AI (credit: Balazs Szegedy using DALL-E-3)

thumb_up_off_alt62

chat_bubble_outline5

repeat11

shareShare

Eliezer Yudkowsky ⏹️

@esyudkowsky

2 years ago

Me: Can you draw a very normal image? ChatGPT: Here is a very normal image depicting a tranquil suburban street scene during the daytime. Me: Not bad, but can you go more normal than that? (cont.)

thumb_up_off_alt34,34K

chat_bubble_outline2,2K

repeat4,4K

shareShare

Prof. Anima Anandkumar

@animaanandkumar

2 years ago

How do we capture local features across multiple resolutions? While standard convolutional layers work only on a fixed input-resolution, we design local neural operators that learn integral and differential kernels, and are principled ways to extend standard convolutions to

thumb_up_off_alt487

chat_bubble_outline7

repeat90

shareShare

Bojan Tunguz

@tunguz

2 years ago

And today is T - 2 weeks for my other @NVIDIA #GTC session - a fireside chat with Christian Szegedy, cofounder of @xAI. Christian is one of the seminal research figures in the Deep Learning community, but the main focus of our chat will be on something that he has been working very

And today is T - 2 weeks for my other @NVIDIA #GTC session - a fireside chat with <a href="/ChrSzegedy/">Christian Szegedy</a>, cofounder of @xAI. Christian is one of the seminal research figures in the Deep Learning community, but the main focus of our chat will be on something that he has been working very

thumb_up_off_alt38

chat_bubble_outline3

repeat6

shareShare

Damien Teney

@damienteney

2 years ago

Why do neural nets generalize so well?🤔 There's a ton of work on SGD, flat minima, ... but the root cause is that their inductive biases somehow match properties of real-world data.🌎 We've examined these inductive biases in *untrained* networks.🎲 arxiv.org/abs/2403.02241 ⬇️

thumb_up_off_alt393

chat_bubble_outline6

repeat68

shareShare

Ted

@tedinreallife

2 years ago

Mechanism for feature learning in neural networks and backpropagation-free machine learning models | Science science.org/doi/10.1126/sc…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Ted

@tedinreallife

2 years ago

Congress could use more computer scientists like Rep. Ted Lieu

Congress could use more computer scientists like <a href="/RepTedLieu/">Rep. Ted Lieu</a>

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Ted

@tedinreallife

2 years ago

Wordle 1,006 5/6 🟩⬜⬜⬜⬜ 🟩🟩⬜⬜⬜ 🟩🟩🟩⬜⬜ 🟩🟩🟩🟩⬜ 🟩🟩🟩🟩🟩

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Zhenhailong Wang

@zhenhailongw

2 years ago

Large multimodal models often lack precise low-level perception needed for high-level reasoning, even with simple vector graphics. We bridge this gap by proposing an intermediate symbolic representation that leverages LLMs for text-based reasoning. mikewangwzhl.github.io/VDLM 🧵1/4

thumb_up_off_alt98

chat_bubble_outline4

repeat26

shareShare

Ted

@tedinreallife

2 years ago

I hope this PDP-10 makes it to the excellent Computer History Museum geekwire.com/2024/seattles-…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Ted

@tedinreallife

7 days ago

Knuth - Claude's cycles problem cs.stanford.edu/~knuth/papers/… solved using Claude Opus 4.6

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Ted

Ted

Glenn K. Lockwood

Ted

Ted

Ted

Soumith Chintala

Christian Szegedy

Ted

Christian Szegedy

Eliezer Yudkowsky ⏹️

Prof. Anima Anandkumar

Bojan Tunguz

Damien Teney

Ted

Ted

Ted

Zhenhailong Wang

Ted

Ted