Bary Levy (@barylevy_) Twitter Tweets • TwiCopy

Bary Levy

8 months ago

In hindsight, this take was incredibly wrong. ChatGPT, Claude and Gemini improved so much over the past 6 months, that I find myself using them more and more every day. They allow me to do things at a speed I never thought would be possible.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Bary Levy

@barylevy_

8 months ago

מס גודש עובד #פידתחבורה

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Bary Levy

@barylevy_

7 months ago

DeepSeek effect

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Taelin

@victortaelin

7 months ago

François Chollet This is what I like the most about you. There are intelligent people here that are completely oblivious to... things, in general. Like they're lost on the jungle. The log scale of o1 basically says it can find any function in exponential time. Same as, you know... brute force.

thumb_up_off_alt66

chat_bubble_outline1

repeat4

shareShare

Keller Jordan

@kellerjordan0

7 months ago

The reason I didn't write a proper arxiv paper for Muon is because I simply don't think there's any relationship between the ability to publish a paper with lots of good-looking results about a new optimizer, and whether that optimizer actually works. I only trust speedruns.

thumb_up_off_alt881

chat_bubble_outline15

repeat38

shareShare

Bary Levy

@barylevy_

6 months ago

There is no time to keep secrets

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

1a3orn

@1a3orn

6 months ago

2025 is gonna be a speedrun of every single idea from decades of RL literature being applied to RL over chain-of-thought.

thumb_up_off_alt472

chat_bubble_outline5

repeat40

shareShare

Bary Levy

@barylevy_

6 months ago

Hot take: all skyscrapers look like this because having lots of daylight while you work is a good thing. Not because of some failed philosophical ideas.

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Basil🧡

@linkofsunshine

6 months ago

Eliezer Yudkowsky ⏹️ We make transistors from sand, but have far passed the amount of transistors by the number of grains of sand!

thumb_up_off_alt32

chat_bubble_outline1

repeat1

shareShare

Bary Levy

@barylevy_

6 months ago

Reward hacking will likely be one of the most talked about problems in AI in the next few years. Most problems that interest us don't have an easily verifiable ground truth like in numeric math problems. The reward function needs to be 100% robust as the enormous optimization

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Bary Levy

@barylevy_

6 months ago

Argumentum ad governmentum "The government does this. Therefore, it's bad" Possibly the most destructive thought process of our time. Especially when these people take power and destroy the very foundation of public health on which modern civilization is built.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Bary Levy

@barylevy_

6 months ago

Can't believe they actually did the meme

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Bary Levy

@barylevy_

6 months ago

Y̶o̶u̶ ̶c̶a̶n̶ just do things.

thumb_up_off_alt6

chat_bubble_outline2

repeat0

shareShare

will brown

@willccbb

6 months ago

we need a “nanoR1” benchmark for RL post-training experimentation. fixed set of reasoning tasks covering a few domains, set a threshold below what current reasoners can easily do but non-reasoners can’t. start with any Qwen2.5 base of your choice, see how fast you can get there

thumb_up_off_alt208

chat_bubble_outline10

repeat8

shareShare

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

@teortaxestex

4 months ago

Maybe the best case study on perception of modernity in traditional cultures

thumb_up_off_alt120

chat_bubble_outline1

repeat4

shareShare

Harlan Stewart

@humanharlan

4 months ago

It's concerning that Dario uses "MRI for AI" to mean cracking interpretability--MRI only reliably diagnoses structural problems like tumors, not problems like schizophrenia, psychopathy, depression, ADHD, etc. I know this sounds like a nitpick, but it's important that AI