Jeremy Pinto (@jerpint) Twitter Tweets • TwiCopy

Jeremy Pinto

2 months ago

This happened to me once. Candidate gave me a perfect pandas one liner (including knowing obscure param names) but couldn’t tell me what df.head() would do

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Tomás Vergara Browne and I were quiet over the summer with our podcast "Behind the Research of AI"... But now we're back! And with an awesome guest! We interviewed Jack Morris during Conference on Language Modeling and had a blast chatting, eating snacks together and reflecting on phd life/research ideas

<a href="/tvergarabrowne/">Tomás Vergara Browne</a> and I were quiet over the summer with our podcast "Behind the Research of AI"...

But now we're back! And with an awesome guest!

We interviewed <a href="/jxmnop/">Jack Morris</a> during <a href="/COLM_conf/">Conference on Language Modeling</a> and had a blast chatting, eating snacks together and reflecting on phd life/research ideas

thumb_up_off_alt33

chat_bubble_outline1

repeat12

shareShare

Jeremy Pinto

@jerpint

2 months ago

Claude is trying to use “sudo”. Do you want to proceed?

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Greg Kamradt

@gregkamradt

2 months ago

We verified the TRM results on the semi private holdout set and they're legit Awesome work and contribution to the open source community by Alexia Jolicoeur-Martineau My notes: * This model is tiny! 7M params, but the rub is that it is relatively expensive to run because pre-training and

thumb_up_off_alt528

chat_bubble_outline12

repeat38

shareShare

Alexia Jolicoeur-Martineau

@jm_alexia

2 months ago

The future of AI doesn't have to break the bank and destroy the environment to reach AGI!

thumb_up_off_alt207

chat_bubble_outline12

repeat14

shareShare

terminal

@terminaldotshop

2 months ago

Is this our future?

thumb_up_off_alt1,1K

chat_bubble_outline33

repeat118

shareShare

Andrej Karpathy

@karpathy

2 months ago

My pleasure to come on Dwarkesh last week, I thought the questions and conversation were really good. I re-watched the pod just now too. First of all, yes I know, and I'm sorry that I speak so fast :). It's to my detriment because sometimes my speaking thread out-executes my

thumb_up_off_alt10,10K

chat_bubble_outline405

repeat1,1K

shareShare

Jeremy Pinto

@jerpint

2 months ago

GPTards needs to become a thing

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Andrej Karpathy

@karpathy

2 months ago

Nice, short post illustrating how simple text (discrete) diffusion can be. Diffusion (i.e. parallel, iterated denoising, top) is the pervasive generative paradigm in image/video, but autoregression (i.e. go left to right bottom) is the dominant paradigm in text. For audio I've

thumb_up_off_alt5,5K

chat_bubble_outline257

repeat566

shareShare

Thariq

@trq212

2 months ago

We launched a sandbox within Claude Code that allows you to define exactly which directories and network hosts your agent can access. Type /sandbox to enable it.

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat97

shareShare

Jeremy Pinto

@jerpint

2 months ago

Just take screenshots of your .md files, problem solved

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

NVIDIA

@nvidia

2 months ago

Space isn’t just for stars anymore. 🌠 Starcloud’s H100-powered satellite brings sustainable, high-performance computing beyond Earth. Learn more: nvda.ws/47eYZvC

thumb_up_off_alt5,5K

chat_bubble_outline421

repeat768

shareShare

Jeremy Pinto

@jerpint

2 months ago

It feels like coding models would be so much more efficient and faster if they had access to "ctrl-c" and "ctrl-v"

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jeremy Pinto

@jerpint

2 months ago

Forget cloud compute, now we have stratospheric compute

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jeremy Pinto

@jerpint

2 months ago

1 c

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jeremy Pinto

@jerpint

2 months ago

🛥️

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jeremy Pinto

@jerpint

2 months ago

🚢

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jeremy Pinto

@jerpint

2 months ago

💀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Alexia Jolicoeur-Martineau

@jm_alexia

2 months ago

Insane finding! You train on at most 16 improvement steps at training, but at inference you do as many steps as possible (448 steps) and you reach crazy accuracy. This is how you build intelligence!!

thumb_up_off_alt423

chat_bubble_outline17

repeat41

shareShare

Jeremy Pinto

@jerpint

2 months ago

I hope your Monday is going as well as mine 🙏

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jeremy Pinto

Jeremy Pinto

Benno Krojer

Jeremy Pinto

Greg Kamradt

Alexia Jolicoeur-Martineau

terminal

Andrej Karpathy

Jeremy Pinto

Andrej Karpathy

Thariq

Jeremy Pinto

NVIDIA

Jeremy Pinto

Jeremy Pinto

Jeremy Pinto

Jeremy Pinto

Jeremy Pinto

Jeremy Pinto

Alexia Jolicoeur-Martineau

Jeremy Pinto