Josh McGrath (@j_mcgraph) Twitter Tweets • TwiCopy

Effie Klimi

6 months ago

“So. we stack layers where each does Wx + b followed by a nonlinearity e.g. ReLU. This builds a deep function. We train it by minimizing loss using backpropagation and gradient descen-“

thumb_up_off_alt4,4K

chat_bubble_outline19

repeat364

shareShare

Josh McGrath

@j_mcgraph

6 months ago

My boss is cool btw you should come report to her and build cool stuff

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Josh McGrath

@j_mcgraph

6 months ago

Real

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

notable imo that the world's most valuable companies *are not* casinos, slop mobile games, or even tiktok/youtube instead they're apple (tools that ehance creativity), meta (originally about connecting people), and nvidia (enabling ai to accelerate science)

thumb_up_off_alt582

chat_bubble_outline50

repeat24

shareShare

Josh McGrath

@j_mcgraph

6 months ago

If you’re working in quant and are tired of the lack of meaning in your work, we’re always hiring! I promise you, we have more than enough compute. P.S. we’re chaotic because building anything new is messy. Do it with us!

thumb_up_off_alt197

chat_bubble_outline5

repeat3

shareShare

shyamal

@shyamalanadkat

6 months ago

getting started with evals doesn't require too much. the pattern that we've seen work for small teams looks a lot like test‑driven development applied to AI engineering: 1/ anchor evals in user stories, not in abstract benchmarks: sit down with your product/design counterpart

thumb_up_off_alt199

chat_bubble_outline4

repeat27

shareShare

Josh McGrath

@j_mcgraph

6 months ago

I think we’re in a timeline where > US builds general robots for manufacturing in the next two years > doesn’t matter because of our land use regulation

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Josh McGrath

@j_mcgraph

6 months ago

can’t believe we got jony just to fix our model naming scheme

thumb_up_off_alt10

chat_bubble_outline1

repeat0

shareShare

rapha

@rapha_gl

6 months ago

opus not being great at benchmarks (but having really good user testimonials) is further confirmation of the deep deep eval crisis we’re in you can max out any benchmark with enough RL, and that doesn’t translate into a good product. you can optimize for DAUs and glaze-hack it

thumb_up_off_alt152

chat_bubble_outline5

repeat10

shareShare

dinos

@din0s_

6 months ago

another banger hit the TL

thumb_up_off_alt34

chat_bubble_outline2

repeat4

shareShare

Josh McGrath

@j_mcgraph

6 months ago

Now that this is out I can finally say that we were building a vape but it turned out to be too dangerous, the tricks we prompted it to do were too sick.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Miles Brundage

@miles_brundage

6 months ago

thumb_up_off_alt229

chat_bubble_outline10

repeat12

shareShare

Jason Wei

@_jasonwei

6 months ago

A recent clarity that I gained is viewing AI research as a “max-performance domain”, which means that you can be world-class by being very good at only one part of your job. As long as you can create seminal impact (e.g., train the best model, start a new paradigm, or create

thumb_up_off_alt654

chat_bubble_outline29

repeat56

shareShare

Josh McGrath

@j_mcgraph

6 months ago

From the same admin trying to cancel nuclear subsidies which could be powering the datacenters lol

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Josh McGrath

@j_mcgraph

6 months ago

I'm a bit skeptical of the papers mentioned below, but I think it's a good thing to rerun the eval yourself and report whatever number you get. It's hard to replicate someones eval setup!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Josh McGrath

@j_mcgraph

6 months ago

Me after using more than one pipe in a command

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Josh McGrath

@j_mcgraph

6 months ago

The admin wants to be good at AI until it means either recognizing > foreign talent > forms of energy they don’t find masculine or something idk

thumb_up_off_alt17

chat_bubble_outline1

repeat0

shareShare

Joshua Achiam

@jachiam0

6 months ago

It feels somewhat astonishing that a third of the strategic bomber fleet of a great power can now be taken out in a daring drone attack - the foundations of war appear to be changing at speed. I don't know what this will lead to.

thumb_up_off_alt96

chat_bubble_outline9

repeat6

shareShare

Duncan S. Campbell

@duncan__c

6 months ago

We must manufacture batteries at scale here in the US if we want to have any shot at the very near future of defense, robotics, mobility, and energy. Basically everything that matters right now goes back to batteries.

thumb_up_off_alt515

chat_bubble_outline22

repeat67

shareShare

Josh McGrath

@j_mcgraph

5 months ago

No, it’s like if neuroscience had unlimited resolution imaging and perfect intervention. You know, two key bottlenecks in that field.

thumb_up_off_alt7

chat_bubble_outline1

repeat0

shareShare

Josh McGrath

Effie Klimi

Josh McGrath

Josh McGrath

Aidan McLaughlin

Josh McGrath

shyamal

Josh McGrath

Josh McGrath

rapha

dinos

Josh McGrath

Miles Brundage

Jason Wei

Josh McGrath

Josh McGrath

Josh McGrath

Josh McGrath

Joshua Achiam

Duncan S. Campbell

Josh McGrath