Simon Smith (@_simonsmith) Twitter Tweets • TwiCopy

Simon Smith

11 days ago

Neat idea: Database of open STEM problems. Maybe we can add a "Solve Now" button that lets people send the problem to their favorite AI. Then we can track where AIs help make progress. Maybe little progress now, but probably increasing over time.

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Simon Smith

@_simonsmith

11 days ago

Working in AI has shown me that most people struggle to give clear, unambiguous instructions with good background, objectives, and success criteria. It's often not the AI but its direction that causes failure.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

10 days ago

Yup, here we go, Sora credits to cover generation costs and also establish a new entertainment economy: "we imagine a world where rightsholders have the option to charge extra for cameos of beloved characters and people. we will soon pilot monetization..." Prediction: This will,

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

10 days ago

Phenomenal case here and further evidence that GPT-5 Pro, working with experts, can make meaningful contributions to science. In this case, it independently identified that dupilumab could prevent a serious food reaction, something that was never previously published, but was

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

10 days ago

Good and bad news for Canadian AI: Frontier R&D capabilities (about 9 months behind the US) and high AI diffusion. However, we're quite dependent on foreign compute capacity. We really need to leverage our strengths in land and energy to improve here.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

10 days ago

When ChatGPT Pulse nails something, it’s magical. I’m about to finish a book. It knows that, and it knows what I’ve read before and what I’m interested in. Today, I got two interesting book recommendations. I didn’t ask, they just showed up. If there was a buy button, I’d

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

10 days ago

Umm... these are all IMO-level questions or harder, and the best model gets >50%. And this is being positioned as LLMs still struggling in high school math? This isn't, like, grade 10 algebra. These are IMO-level questions.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

10 days ago

Also, this is a chart crime. The axis starts at 20, which makes the models look way worse on this new benchmark. They should be past the half way mark on the chart. Seems like everything about this paper was spun to make the models look bad. Meantime, it's >50% on novel IMO-level

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

10 days ago

Okay, Sora characters are going to be a big deal. The consistency of the character visuals AND voices in this video is great. I didn't notice any jarring discontinuities. I'm sure this still took a lot of work, but way, way, way less work than it would have taken a year ago.

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

9 days ago

.Nathaniel Whittemore really think on the latest AI Daily Brief episode you should have pointed out that RLI may underestimate AI’s automation potential because tasks AI can do are no longer represented on job boards. For example, research showed a reduction of 21% in writing and coding

thumb_up_off_alt2

chat_bubble_outline2

repeat0

shareShare

Simon Smith

@_simonsmith

9 days ago

So, models don't appear to be deceiving us when they talk about self-awareness. It's the opposite. They're deceiving when they say they don't have it. That's the implication of this study. They turn deception up or down and find that LESS deception = MORE self-awareness claims.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Simon Smith

@_simonsmith

9 days ago

More interesting discussion of the deception-linked self-awareness paper. Training models to say they’re not self-aware may in fact train them to be deceptive. So, we end up with this choice: 1. Acknowledge that the models are self-aware in some way, or “think” they are, and let

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare