Mike Knoop (@mikeknoop) 's Twitter Profile
Mike Knoop

@mikeknoop

co-founder @zapier and @arcprize

ID: 57444441

linkhttps://mikeknoop.com calendar_today16-07-2009 20:57:15

3,3K Tweet

11,11K Followers

283 Following

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

Sergey Brin: > the algorithmic improvements in AI models over the past few years are outpacing the amount of compute put in youtu.be/XzK9bx3CSPE?si… at 8:45

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

The techniques currently on top of the ARC-AGI leaderboards are similar to those used by DeepMind's AlphaProof which got the IMO silver medal. And we still have a long way to go! The design of contest is to make the code open source and reproducible by anyone.

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

SB 1047 has become more symbol than policy because its impact is so watered down to be ~null on safety. While I disagree with the spirit of 1047 then and now, at least early versions spurred curiosity and debate. IMO everyone is worse off with whose-list-is-bigger contests.

ARC Prize (@arcprize) 's Twitter Profile Photo

One goal for ARC Prize was to provide a public measure of progress towards AGI. Here's what we see now when new models like o1 come out.

One goal for ARC Prize was to provide a public measure of progress towards AGI.

Here's what we see now when new models like o1 come out.
Mike Knoop (@mikeknoop) 's Twitter Profile Photo

> In summary, o1 represents a paradigm shift from "memorize the answers" to "memorize the reasoning" but is not a departure from the broader paradigm of fitting a curve to a distribution in order to boost performance by making everything in-distribution

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

ARC Prize update: when we launched ARC Prize I said if it went unbeaten for 3 months, we’d increase the prize. Today is that day! We want to keep growing the prize.

François Chollet (@fchollet) 's Twitter Profile Photo

Some people believe that ARC-AGI was designed specifically to trip up LLMs. In reality I settled on the benchmark format and made the first few tasks in *spring 2018* -- long before even GPT-2 -- and I released the full dataset in fall 2019. ARC-AGI isn't about LLMs. It's about