Mike Knoop (@mikeknoop) 's Twitter Profile
Mike Knoop

@mikeknoop

co-founder @ndea and @zapier @arcprize

ID: 57444441

linkhttps://mikeknoop.com calendar_today16-07-2009 20:57:15

3,3K Tweet

20,20K Followers

332 Following

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

i read the AlphaEvolve paper over the weekend. my analysis below! key question: how important is it? there are two dominant architectural substrates of frontier AI systems: neural networks (NN) and symbolic programs (SP). they exist at opposite ends of a spectrum with very

Wade Foster (@wadefoster) 's Twitter Profile Photo

27% of @Remote’s IT tickets are solved automatically. Using an AI-powered IT help desk built in Zapier. (It’s saving the team $500k a year in hiring costs 👀)

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

ARC v2 paper is officially out! we tested v2 in a controlled setting with over 400 humans, this report contains details and analysis to substantiate our relative "easy for humans, hard for AI" claim. we'll be releasing the raw data later this week.

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

Here's a paper with very interesting research on the security of LLM text embeddings. Authors claim they can reverse embeddings to text with no access to the original model and only a few ~thousand samples arxiv.org/abs/2505.12540

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

And now available via Anthropic API -- today Anthropic announced the Zapier MCP connector support: anthropic.com/news/agent-cap…

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

IMO this is broadly a good take. I'd soften "expert" as I think it will be difficult to deny AGI once AI can do all tasks humans find relatively easy. Today it remains easy to find such tasks.

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

I've been looking forward to hearing about John's work for years. He was one of first high profile people to bet on alternative ideas to scaling up LLMs. New ideas still needed for AGI. Highly recommend checking this out.

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

Claude Sonnet 4 ARC results. One broad useful note for devs is the synchronous API test-time compute limits of most TTC systems is ~linear on accuracy vs cost. Providers have picked their sync TTC points well, the exponential cost regime requires streaming API.

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

Most founders think great SEO is about hygiene: title tags, keywords, back links, etc. These are only table stakes to get and stay indexed. Great SEO is actually more like user research. In fact, Google is basically an online RL algorithm doing user research. Users have a broad

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

Thanks to friends at Anthropic we've now got Claude 4 Opus results on ARC! The most interesting finding is semi-private v2 scoring ~9% (new commercial SOTA) while v1 only at 35%.

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

True for most automation scenarios. Given machine intelligence on par with human capability for a task, machines offer lower variability and will be preferred (even commanding a premium on price).

Mike Knoop (@mikeknoop) 's Twitter Profile Photo

Hard thing about "AI SEO" is the O. There is no feedback loop to optimize, unlike classic search which has indexing and bounce rate loops. For niche/long-tail/novel content, AI falls back to classic search anyway. For now you're better off using AI to win classic SEO.