Jack Cole (@mindsai_jack) Twitter Tweets • TwiCopy

Jack Cole

@mindsai_jack

7 months ago

More great work from Toby Simonds Tufalabs

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

1/ Gemini 2.5 is here, and it’s our most intelligent AI model ever. Our first 2.5 model, Gemini 2.5 Pro Experimental is a state-of-the-art thinking model, leading in a wide range of benchmarks – with impressive improvements in enhanced reasoning and coding and now #1 on

thumb_up_off_alt7,7K

chat_bubble_outline313

repeat1,1K

shareShare

Bindu Reddy

@bindureddy

7 months ago

WE HAVE A NEW BEST MODEL IN THE WORLD! GEMINI 2.5 IS #1 ON LIVEBENCH

thumb_up_off_alt1,1K

chat_bubble_outline111

repeat179

shareShare

Anthropic

@anthropicai

7 months ago

Last month we launched our Anthropic Economic Index, to help track the effect of AI on labor markets and the economy. Today, we’re releasing the second research report from the Index, and sharing several more datasets based on anonymized Claude usage data.

thumb_up_off_alt2,2K

chat_bubble_outline54

repeat325

shareShare

Jack Cole

@mindsai_jack

7 months ago

So tempting...

thumb_up_off_alt5

chat_bubble_outline2

repeat0

shareShare

Machine Learning Street Talk

@mlstreettalk

7 months ago

We are hiring full-time senior video editors and motion graphics artists in the UK to work in our offices near London. We are making some of the most interesting technical content on YouTube, it will be exciting! Hit me up or refer friends 🙏

thumb_up_off_alt46

chat_bubble_outline5

repeat11

shareShare

Machine Learning Street Talk

@mlstreettalk

7 months ago

AI beliefs & reward functions ≠ ours. While simple games (Go, Chess) have clear win/lose rewards, complex real-world situations don't. "Reward is all you need" is partly true, but where does the reward function come from? This is Dr. Jeff Beck from Noumenal Labs

thumb_up_off_alt113

chat_bubble_outline13

repeat19

shareShare

Artem Kirsanov

@artemkrsv

7 months ago

Why does linear regression minimize squared error? I always thought it was just computationally convenient compared to absolute values. But then I discovered it naturally emerges from maximum likelihood estimation with Gaussian noise assumptions 🤯 Same with regularization—L2/L1

thumb_up_off_alt341

chat_bubble_outline13

repeat28

shareShare

Jack Cole

@mindsai_jack

7 months ago

Amazing work from Meta. We will probably see a variety of competitive drops in the next few weeks.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Greg Kamradt

@gregkamradt

7 months ago

.Bryan Landers made this sweet graphic for us It's a public, internal tool we're hosting here: arcprize.org/2025-sota Not guaranteed to be up to date (we're getting there though)

thumb_up_off_alt9

chat_bubble_outline1

repeat2

shareShare

Jack Cole

@mindsai_jack

7 months ago

Amazing application of TTT to video generation.

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Machine Learning Street Talk

@mlstreettalk

7 months ago

In a recent discussion, Prof. Kevin Ellis and Dr. Zenna Tavares of Basis explored pathways toward artificial intelligence capable of more human-like learning, focusing on acquiring abstract knowledge from limited experience through active interaction. 🧵👇

thumb_up_off_alt55

chat_bubble_outline3

repeat14

shareShare

Jack Cole

@mindsai_jack

7 months ago

Ouch... Maybe the reasoning version in the future will fare better.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

ARC Prize

@arcprize

7 months ago

.Isaac Liao has open sourced his "ARC-AGI Without Pretraining" notebook on Kaggle You can use it today and enter ARC Prize 2025 It currently scores 4.17% on ARC-AGI-2 (5th place) Amazing mid-year sharing and contribution Thank you Isaac

thumb_up_off_alt129

chat_bubble_outline2

repeat8

shareShare

ElevenLabs

@elevenlabsio

7 months ago

Creating a Professional Voice Clone (PVC) of your own voice allows you to produce high-quality voiceovers that sound exactly like you. Today, our team shipped a brand new version of the PVC creation flow that makes it much easier to create a perfect clone of your voice.

thumb_up_off_alt1,1K

chat_bubble_outline39

repeat164

shareShare

OpenAI Developers

@openaidevs

6 months ago

Meet Codex CLI—an open-source local coding agent that turns natural language into working code. Tell Codex CLI what to build, fix, or explain, then watch it bring your ideas to life.

thumb_up_off_alt3,3K

chat_bubble_outline97

repeat504

shareShare

Noam Brown

@polynoamial

6 months ago

Our new OpenAI o3 and o4-mini models further confirm that scaling inference improves intelligence, and that scaling RL shifts up the whole compute vs. intelligence curve. There is still a lot of room to scale both of these further.

Our new <a href="/OpenAI/">OpenAI</a> o3 and o4-mini models further confirm that scaling inference improves intelligence, and that scaling RL shifts up the whole compute vs. intelligence curve. There is still a lot of room to scale both of these further.

thumb_up_off_alt1,1K

chat_bubble_outline52

repeat165

shareShare

Mike Knoop

@mikeknoop

6 months ago

Re-testing released o3 on ARC-AGI-1 will take a day or two. Because today's release is a materially different system, we are re-labeling our past reported results as "preview": o3-preview (low): 75.7%, $200/task o3-preview (high): 87.5%, $34.4k/task Above uses o1 pro pricing

thumb_up_off_alt274

chat_bubble_outline17

repeat15

shareShare

Wes Roth

@wesrothmoney

6 months ago

Google's new Quantization-Aware Training (QAT) models shrink Gemma 3's VRAM needs by up to 75% — without killing performance. That means: ✅ Run Gemma 3 27B on a 3090 GPU ✅ Use 12B on a laptop ✅ Deploy 1B on... basically a potato Open-source, Hugging Face ready,

thumb_up_off_alt20

chat_bubble_outline0

repeat5

shareShare

xAI

@xai

6 months ago

Let’s start with Grok 3 Mini. When we set out to build a fast, affordable mini model, we knew it would be good but even we didn’t expect it to be this good. Some highlights: - Grok 3 Mini tops the leaderboards on graduate-level STEM, math, and coding, outcompeting flagship

thumb_up_off_alt1,1K

chat_bubble_outline84

repeat183

shareShare

Jack Cole

Jack Cole

Sundar Pichai

Bindu Reddy

Anthropic

Jack Cole

Machine Learning Street Talk

Machine Learning Street Talk

Artem Kirsanov

Jack Cole

Greg Kamradt

Jack Cole

Machine Learning Street Talk

Jack Cole

ARC Prize

ElevenLabs

OpenAI Developers

Noam Brown

Mike Knoop

Wes Roth

xAI