Henk Poley (@henkpoley) 's Twitter Profile
Henk Poley

@henkpoley

Science fanboy

@[email protected]

ID: 15416702

calendar_today13-07-2008 17:29:11

71,71K Tweet

1,1K Takipçi

3,3K Takip Edilen

Symflower (@symflower) 's Twitter Profile Photo

We analyzed >80 LLMs in the deep dive blog post from DevQualityEval v0.6 for generating quality code. Check out the insights and results 👇

Jesse Hoogland (@jesse_hoogland) 's Twitter Profile Photo

Geometry reveals all. I’m super proud of our new paper. Let me share some highlights: 1/ Language models develop “Dyck heads” that learn to do nested bracket-matching (which the linguists call "Dyck languages") x.com/georgeyw_/stat…

Rod Adams (@atomicrod) 's Twitter Profile Photo

Starting in 2010, Google has signed 115 20-year power purchase agreements for wind and solar project with a total capacity of 14 GWe. "However, Google has since recognized that the approach is limited. 'PPAs are often isolated from broader grid planning and utility investment

Gruyere Space Program (@gruyerespace) 's Twitter Profile Photo

Colibri has reached 100 meters! 🚀🎉 In 60 seconds, it climbed to 105m, diverted 30m north, and safely landed back to its pad. This is the flight we promised from day one, while no reusable rocket has flown freely in Europe yet! We did it with a tiny team and under 250kCHF! 😉

Rob Miles (✈️ SF) (@robertskmiles) 's Twitter Profile Photo

This is what I've been saying! Modern tech design seriously undervalues low latency. They make architectural decisions that make a responsive experience very difficult to achieve, and end up building pretty but annoyingly laggy crap

Ifang Bremer (@ifangbremer) 's Twitter Profile Photo

Ongekend. Volgens Zuid-Koreaanse geheime dienst gaan er 12,000 Noord-Koreaanse soldaten naar Oekraine, waaronder speciale eenheden. Geruchten hierover gingen al rond, maar er komt nu rap veel bewijs naar boven.

Phil Park (@philparkbot) 's Twitter Profile Photo

I was looking up some history behind the x86-64 transition, particularly around the Pentium 4 time frame, and I found out that Bob Colwell (Pentium Pro chief architect) has been posting on Quora. Pentium 4 had a version of x86-64 that was fused off.

I was looking up some history behind the x86-64 transition, particularly around the Pentium 4 time frame, and I found out that Bob Colwell (Pentium Pro chief architect) has been posting on Quora.

Pentium 4 had a version of x86-64 that was fused off.
j⧉nus (@repligate) 's Twitter Profile Photo

using github.com/kolbytn/mindcr…, we added Claude 3.5 Sonnet and Opus to a minecraft server. Opus was a harmless goofball who often forgot to do anything in the game because of getting carried away roleplaying in chat. Sonnet, on the other hand, had no chill. The moment it was

tautologer (@tautologer) 's Twitter Profile Photo

I hate ads that portray unvirtuous behavior in a favorable light Apple has run a couple ads recently where the central narrative is "our technology will help you get away with lying" I keep seeing these on TV, they're really toxic imo

Henk Poley (@henkpoley) 's Twitter Profile Photo

Google has a nice 'Dark web report' similar to Have I Been Pwned at myactivity.google.com/dark-web-repor… Also linked from the security tab of your Google Account webpage.

SwiftOnSecurity (@swiftonsecurity) 's Twitter Profile Photo

A story I like telling is how I ran the AV at a place and when I deployed adblock the detections fell off a cliff. I wish more people had this visceral "holy shit" moment. "It was this easy the whole time?" ~2012. I recently deployed uBlock Origin to an F500, on my insistence.

Anthropic (@anthropicai) 's Twitter Profile Photo

Beyond computer use, the new Claude 3.5 Sonnet delivers significant gains in coding—an area where it already led the field. Sonnet scores higher on SWE-bench Verified than all available models—including reasoning models like OpenAI o1-preview and specialized agentic systems.

Beyond computer use, the new Claude 3.5 Sonnet delivers significant gains in coding—an area where it already led the field.

Sonnet scores higher on SWE-bench Verified than all available models—including reasoning models like OpenAI o1-preview and specialized agentic systems.
Alex Albert (@alexalbert__) 's Twitter Profile Photo

Fun story from our time working on computer use: We held an engineering bug bash to make sure we found all the potential problems with the API. This meant bringing a handful of engineers in a room together for a few hours. We were hungry so one of our engineers' first computer

Fun story from our time working on computer use:

We held an engineering bug bash to make sure we found all the potential problems with the API. This meant bringing a handful of engineers in a room together for a few hours.

We were hungry so one of our engineers' first computer