Jason D. Clinton 🔸 (@jasondclinton) 's Twitter Profile
Jason D. Clinton 🔸

@jasondclinton

CISO at Anthropic. Ex-Google Chrome. My views are not those of my employer.

ID: 1445543894586834954

linkhttps://www.jasonclinton.com/ calendar_today06-10-2021 00:18:39

704 Tweet

2,2K Takipçi

226 Takip Edilen

Augment Code (@augmentcode) 's Twitter Profile Photo

Claude Sonnet 4.5 is now the default model for Augment Code. We’re rolling it out to all customers over the next 24 hours, where it will be available alongside Sonnet 4 (for a limited time) and GPT-5 in the model picker. Here’s how Sonnet 4.5 compares to Sonnet 4:

Dan Shipper 📧 (@danshipper) 's Twitter Profile Photo

BREAKING: Anthropic just dropped Claude Sonnet 4.5! We've been testing it for a few days Every 📧 and here's what we found: - It's smarter and faster than Opus: It solved a nasty bug for Kieran Klaassen than Opus 4.1 was continually failing at. And it feels twice as fast. - It's

BREAKING:

Anthropic just dropped Claude Sonnet 4.5! We've been testing it for a few days <a href="/every/">Every 📧</a> and here's what we found:

- It's smarter and faster than Opus: It solved a nasty bug for <a href="/kieranklaassen/">Kieran Klaassen</a> than Opus 4.1 was continually failing at. And it feels twice as fast.

- It's
Dylan Field (@zoink) 's Twitter Profile Photo

Spent the last few days exploring the capabilities of Claude Sonnet 4.5 in Figma Make! It's a very impressive model and we are starting a staged rollout to users immediately. Some thoughts in video below — recorded late last night!

Ethan Mollick (@emollick) 's Twitter Profile Photo

I had some early access to Sonnet 4.5. It is a really good model. I saw especially big jumps in doing finance and statistics, which tend to get overlooked in the focus on coding.

Notion (@notionhq) 's Twitter Profile Photo

Shipping today: Anthropic's Sonnet 4.5 in Notion. Better reasoning, smarter planning, and improved understanding that makes your personal Agent feel truly personalized. Built to get work done.

Axel Darmouni (@adarmouni) 's Twitter Profile Photo

Claiming again how insane it is UI-Tars 2 from ByteDance, which was SoTA in CU before that, is at *47.5%*, which is FOURTEEN points below Sonnet 4.5 This model trumps everything in Computer Use and it’s not even close, Anthropic WILL advance and remain SoTA there

Claiming again how insane it is

UI-Tars 2 from ByteDance, which was SoTA in CU before that, is at *47.5%*, which is FOURTEEN points below Sonnet 4.5

This model trumps everything in Computer Use and it’s not even close, Anthropic WILL advance and remain SoTA there
Factory (@factoryai) 's Twitter Profile Photo

Sonnet 4.5, meet Droid. After joint testing with Anthropic, we find the strengths of Sonnet 4.5 to be: • Significantly more reliable and accurate file editing • High environmental awareness • Snappier than previous models on quick questions, not overthinking simple asks

Rachel Wu (@jupiterana93211) 's Twitter Profile Photo

Claude Sonnet 4.5 is here and I got to try it for about 2 days. I would say It's the strongest model for building complex agents. Multi-agents system worked for the first time for me . Let's see what crazy agents people come up with.

Claude Sonnet 4.5 is here and I got to try it for about 2 days. 

I would say It's the strongest model for building complex agents. Multi-agents system worked for the first time  for me . 

Let's see what crazy agents people come up with.
Simon Willison (@simonw) 's Twitter Profile Photo

Wrote up my initial impressions of the brand new Claude Sonnet 4.5 - I think it may live up to Anthropic's claims of being the "best coding model in the world", for the next few weeks at least! simonwillison.net/2025/Sep/29/cl…

Jason D. Clinton 🔸 (@jasondclinton) 's Twitter Profile Photo

Huge amount of effort went into augmenting 4.5 for cyber use with our partners. We think that 4.5 is the best model for cyber defense now.

Jack Lindsey (@jack_w_lindsey) 's Twitter Profile Photo

Prior to the release of Claude Sonnet 4.5, we conducted a white-box audit of the model, applying interpretability techniques to “read the model’s mind” in order to validate its reliability and alignment. This was the first such audit on a frontier LLM, to our knowledge. (1/15)

Prior to the release of Claude Sonnet 4.5, we conducted a white-box audit of the model, applying interpretability techniques to “read the model’s mind” in order to validate its reliability and alignment. This was the first such audit on a frontier LLM, to our knowledge. (1/15)
Ethan Mollick (@emollick) 's Twitter Profile Photo

AI agents are now capable of doing real, if bounded, work. But that work can be very valuable. For example, the new Claude Sonnet 4.5 was able to replicate published economics research from data files & the paper. We need to figure out what to do with it: oneusefulthing.org/p/real-ai-agen…

Sam Paech (@sam_paech) 's Twitter Profile Photo

Sonnet 4.5 tops both writing evals! On spiral-bench: much stronger pushback & de-escalation vs sonnet-4. GLM-4.6's score is incremental over GLM-4.5 - but personally I like the newer version's writing much better. Links & writing samples ->

Sonnet 4.5 tops both writing evals!

On spiral-bench: much stronger pushback &amp; de-escalation vs sonnet-4.

GLM-4.6's score is incremental over GLM-4.5 - but personally I like the newer version's writing much better.

Links &amp; writing samples -&gt;