Evan Mays (@evanon0ping) Twitter Tweets • TwiCopy

Evan Mays

@evanon0ping

9 months ago

This kind of stuff seems inevitable

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Evan Mays

@evanon0ping

9 months ago

People over fixate on SWE bench scores, every lab runs a different subset of the benchmark so no one really knows which model is really SOTA Try new models on your own real world tasks

thumb_up_off_alt18

chat_bubble_outline1

repeat1

shareShare

Aidan McLaughlin

@aidan_mclau

9 months ago

guys, ai progress just isn't slowing down gpt-5 completes tasks that take 52% longer trust the exponential (so i don't chartcrime, full plot linked below)

thumb_up_off_alt1,1K

chat_bubble_outline87

repeat79

shareShare

Stephen McAleer

@mcaleerstephen

9 months ago

We've entered a new phase where progress in chatbots is starting to top out but progress in automating AI research is steadily improving. It's a mistake the confuse the two.

thumb_up_off_alt251

chat_bubble_outline16

repeat22

shareShare

Evan Mays

@evanon0ping

8 months ago

Data centers on the moon are our generation’s flag on the moon

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Evan Mays

@evanon0ping

8 months ago

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Evan Mays

@evanon0ping

8 months ago

PSA: check your surroundings before opening the Waymo door to get out Mine parked next to a pole today and the door door hit the pole

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Evan Mays

@evanon0ping

7 months ago

A day with the homies will fix your HRV

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Evan Mays

@evanon0ping

7 months ago

Ani greeting me as I wake up from my hibernation pod in the year 3035

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Evan Mays

@evanon0ping

7 months ago

Calling it now: All these SaaS companies that charge per seat are going to lobby for Agents to have rights in 2030

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

x.com/Smol_AI/status… paper doesn't mention "AGI" but if you consider that we used to define AGI as "outperform humans at most economically valuable work” then surely GDPVal is the most direct AGI benchmark we have ever had and we are between 77-95% of the way there and should

thumb_up_off_alt79

chat_bubble_outline2

repeat3

shareShare

Evan Mays

@evanon0ping

7 months ago

so proud of openai for this you cant make an honest AI if you aren't honest with yourself

thumb_up_off_alt44

chat_bubble_outline1

repeat3

shareShare

John Kim

@john_sungjin

7 months ago

Today, we're launching Village to let investors orchestrate teams of agents to scale their judgment. Three years ago, we bet that both humans and agents would need new tools to fulfill the promise of LLMs to transform research. Village is that tool, the first IDE for research

thumb_up_off_alt112

chat_bubble_outline19

repeat22

shareShare

Evan Mays

@evanon0ping

7 months ago

Should robots run or bike when commuting

thumb_up_off_alt3

chat_bubble_outline3

repeat0

shareShare

Olivia Li

@oliviali_

7 months ago

We’ve been cooking up some pretty cool tech. If you’re an engineer/scientist who wants to work on quite literally world changing work, we’re hiring

thumb_up_off_alt175

chat_bubble_outline5

repeat3

shareShare

Evan Mays

@evanon0ping

6 months ago

Most open source releases are overhyped, we should doubt any lab’s staying power unless they consistently ship bleeding edge models

thumb_up_off_alt9

chat_bubble_outline1

repeat0

shareShare

dtjourney

@dtjourney

6 months ago

ZINGAGE IS IN PRODUCTION HELL. WE ARE HIRING 10 ENGINEERS NOW.

thumb_up_off_alt151

chat_bubble_outline4

repeat28

shareShare

Evan Mays

@evanon0ping

6 months ago

I ❤️ nvidia

thumb_up_off_alt11

chat_bubble_outline1

repeat0

shareShare

Rhythm Garg

@rhythmrg

6 months ago

Excited to share what Yash Patil, Linden Li, and I have been up to since OpenAI: Applied Compute Companies like Cognition, DoorDash, and Mercor have already captured the initial gains from generalist models. They’re now pulling ahead with Specific Intelligence: custom

thumb_up_off_alt110

chat_bubble_outline12

repeat10

shareShare

Evan Mays

@evanon0ping

6 months ago

In space no one can hear your datacenter

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Evan Mays

Evan Mays

Evan Mays

Aidan McLaughlin

Stephen McAleer

Evan Mays

Evan Mays

Evan Mays

Evan Mays

Evan Mays

Evan Mays

swyx

Evan Mays

John Kim

Evan Mays

Olivia Li

Evan Mays

dtjourney

Evan Mays

Rhythm Garg

Evan Mays