Benj Edwards (@benjedwards) Twitter Tweets • TwiCopy

When my partner mentions the dishes I've left on the side, I stare into the distance, channel Galadriel, and say 'That is no ordinary mess. That is the work of Sauron.' Left my washing in the tumble dryer? Sauron. Pile of books on the floor? Sauron's evil knows no bounds...

thumb_up_off_alt4

chat_bubble_outline0

repeat1

David Hogg 🟧

@davidhogg111

9 days ago

I’ve learned a lot helping my family get my dad into hospice but one of the biggest realizations I’ve had is that our country is facing a geriatric financial time bomb. It is absolutely insane how much elder care costs. For my family if we were to have someone 24/7 it’s $450k a

thumb_up_off_alt14,14K

chat_bubble_outline1,1K

repeat2,2K

Simon Willison

@simonw

9 days ago

Anyone got a good example of a “reasoning” prompt that fails in GPT-4o but succeeds in the newly launched o1? openai.com/o1/

thumb_up_off_alt649

chat_bubble_outline110

repeat27

Benj Edwards

9 days ago

OpenAI's awkward "o1" AI model branding is kinda strange. "Strawberry" was right there, already christened and used by people to describe it for months

thumb_up_off_alt1

chat_bubble_outline2

Noam Brown

@polynoamial

8 days ago

Believe it or not, the name Strawberry does not come from the “How many r’s are in strawberry” meme. We just chose a random word. As far as we know it was a complete coincidence.

thumb_up_off_alt1,1K

chat_bubble_outline78

repeat39

Beatles Archive

@beatlesarchive2

8 days ago

Hey Jude Rehearsals, July 30, 1968 The #Beatles via Lucy (in the sky) ♟️

thumb_up_off_alt1,1K

chat_bubble_outline10

repeat349

MMitchell

@mmitchell_ai

8 days ago

YAY! The CEO of OpenAI just recognized that LLMs generate text-based tokens using randomness and probability! Something objectively true that people have oddly made controversial! Check out our paper on this that introduced the term. dl.acm.org/doi/10.1145/34… It's a good day. 🤗🦜

Benj Edwards

8 days ago

Benj Edwards

8 days ago

Last November, several news outlets called OpenAI's new o1 model (then named Q*) a "powerful AI discovery" that insiders said "could threaten humanity" Now that o1-preview is out, I would love to hear if anyone thinks that is the case

thumb_up_off_alt3

chat_bubble_outline2

repeat2

Benj Edwards

8 days ago

OpenAI's o1-preview does pretty well on my "magenta" test. But the first LLM that just answers "no" without any qualifications will probably be AGI.😁 Reading its internal reasoning can be pretty amusing

thumb_up_off_alt7

chat_bubble_outline2

repeat1

Benj Edwards

8 days ago

So that’s why you only get three wishes

thumb_up_off_alt0

chat_bubble_outline0

Simon Willison

@simonw

8 days ago

Here's everything I've figured out about the new OpenAI o1 family of models so far simonwillison.net/2024/Sep/12/op…

thumb_up_off_alt523

chat_bubble_outline22

repeat85

Ethan Mollick

@emollick

8 days ago

OpenAI’s o1 is also the first specialized frontier model available widely. It doesn’t do everything better than GPT-4o, but it does a few classes of things a lot better. Unless you are doing problems that benefit from planning a solution, you may not see improvement.

thumb_up_off_alt534

chat_bubble_outline28

repeat24

Benj Edwards

7 days ago

It occurred to me today that I am probably not thinking deeply enough to test o1 properly.😁I have fed it various tasks—and no, it's not perfect with every result—but it's clear to me that putting its logical abilities to the test will require a rethink

thumb_up_off_alt3

chat_bubble_outline1

LeVar Burton

@levarburton

7 days ago

So excited… #bydhttmwfi

thumb_up_off_alt1,1K

chat_bubble_outline40

repeat172