Hiskias (@zikud_s) Twitter Tweets • TwiCopy

Hiskias

@zikud_s

a year ago

Doesn't sound post apocalyptic at all 👀

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

object Object

@namedobject

a year ago

sometimes i hear about new tech updates from Fireship before i see them anywhere else

sometimes i hear about new tech updates from <a href="/fireship_dev/">Fireship</a> before i see them anywhere else

thumb_up_off_alt658

chat_bubble_outline20

repeat38

shareShare

Brilliant paper from Meta having the potential to significantly boost LLM's reasoning power. Why force AI to explain in English when it can think directly in neural patterns? Imagine if your brain could skip words and share thoughts directly - that's what this paper achieves

Brilliant paper from <a href="/Meta/">Meta</a> having the potential to significantly boost LLM's reasoning power.

Why force AI to explain in English when it can think directly in neural patterns?

Imagine if your brain could skip words and share thoughts directly - that's what this paper achieves

thumb_up_off_alt2,2K

chat_bubble_outline65

repeat319

shareShare

Ethan Mollick

@emollick

a year ago

This bit of Sam Altman’s newest post is similar in tone to a post by the CEO of Anthropic & what many (not all) researchers from every lab have been saying publicly and privately. You do not have to believe them, but I think they believe what they are saying, for what it worth.

thumb_up_off_alt2,2K

chat_bubble_outline97

repeat248

shareShare

Ali Behrouz

@behrouz_ali

a year ago

Attention has been the key component for most advances in LLMs, but it can’t scale to long context. Does this mean we need to find an alternative? Presenting Titans: a new architecture with attention and a meta in-context memory that learns how to memorize at test time. Titans

thumb_up_off_alt3,3K

chat_bubble_outline78

repeat609

shareShare

Hiskias

@zikud_s

a year ago

An AI model hiring hitmen and planning assassinations 😂this is definitely 𝑁𝑂𝑇 gonna get outta hand. That's why we absolutely need AI red teamers to catch and prevent misuse like this before it spirals.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jim Fan

@drjimfan

a year ago

We are living in a timeline where a non-US company is keeping the original mission of OpenAI alive - truly open, frontier research that empowers all. It makes no sense. The most entertaining outcome is the most likely. DeepSeek-R1 not only open-sources a barrage of models but

thumb_up_off_alt8,8K

chat_bubble_outline224

repeat1,1K

shareShare

Hiskias

@zikud_s

a year ago

Admirable that DeepSeek is building a truly **open AI**!

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Jiayi Pan

@jiayi_pirate

a year ago

We reproduced DeepSeek R1-Zero in the CountDown game, and it just works Through RL, the 3B base LM develops self-verification and search abilities all on its own You can experience the Ahah moment yourself for < $30 Code: github.com/Jiayi-Pan/Tiny… Here's what we learned 🧵

thumb_up_off_alt6,6K

chat_bubble_outline195

repeat1,1K

shareShare

Andrej Karpathy

@karpathy

a year ago

We have to take the LLMs to school. When you open any textbook, you'll see three major types of information: 1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent

thumb_up_off_alt12,12K

chat_bubble_outline365

repeat1,1K

shareShare

Figure

@figure_robot

a year ago

Meet Helix, our in-house AI that reasons like a human Robotics won't get to the home without a step change in capabilities Our robots can now handle virtually any household item:

thumb_up_off_alt10,10K

chat_bubble_outline873

repeat2,2K

shareShare

Andrej Karpathy

@karpathy

a year ago

This is interesting as a first large diffusion-based LLM. Most of the LLMs you've been seeing are ~clones as far as the core modeling approach goes. They're all trained "autoregressively", i.e. predicting tokens from left to right. Diffusion is different - it doesn't go left to

thumb_up_off_alt11,11K

chat_bubble_outline388

repeat1,1K

shareShare

Siyan Zhao

@siyan_zhao

a year ago

Introducing d1🚀 — the first framework that applies reinforcement learning to improve reasoning in masked diffusion LLMs (dLLMs). Combining masked SFT with a novel form of policy gradient algorithm, d1 significantly boosts the performance of pretrained dLLMs like LLaDA.

thumb_up_off_alt564

chat_bubble_outline8

repeat101

shareShare

Yuchen Jin

@yuchenj_uw

a year ago

I saw a guy coding today. Tab 1 ChatGPT. Tab 2 Gemini. Tab 3 Claude. Tab 4 Grok. Tab 5 DeepSeek. He asked every AI the same exact question. Patiently waited, then pasted each response into 5 different Python files. Hit run on all five. Pick the best one. Like a psychopath. It's

thumb_up_off_alt130,130K

chat_bubble_outline1,1K

repeat7,7K

shareShare

Thomas Wolf

@thom_wolf

a year ago

we've seen nothing yet! hosted a 9-13 yo vibe-coding event w. Robert Keus 👨🏼‍💻 this w-e (h/t Anton Osika – eu/acc Lovable Build) takeaway? AI is unleashing a generation of wildly creative builders beyond anything I'd have imagined and they grow up *knowing* they can build anything!

thumb_up_off_alt2,2K

chat_bubble_outline124

repeat293

shareShare

Benjamin Todd

@ben_j_todd

10 months ago

Why can AIs code for 1h but not 10h? A simple explanation: if there's a 10% chance of error per 10min step (say), the success rate is: 1h: 53% 4h: 8% 10h: 0.002% Toby Ord has tested this 'constant error rate' theory and shown it's a good fit for the data chance of

Why can AIs code for 1h but not 10h?

A simple explanation: if there's a 10% chance of error per 10min step (say), the success rate is:

1h: 53%
4h: 8%
10h: 0.002%

<a href="/tobyordoxford/">Toby Ord</a> has tested this 'constant error rate' theory and shown it's a good fit for the data

chance of

thumb_up_off_alt1,1K

chat_bubble_outline71

repeat150

shareShare

Hiskias

@zikud_s

10 months ago

😂 I can see that happening

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Hiskias

Hiskias

object Object

Rohan Paul

Ethan Mollick

Ali Behrouz

Hiskias

Jim Fan

Hiskias

Jiayi Pan

Andrej Karpathy

Figure

Andrej Karpathy

Siyan Zhao

Yuchen Jin

Thomas Wolf

Benjamin Todd

Hiskias