Sergio Sánchez (@s3rgiosanchez) 's Twitter Profile
Sergio Sánchez

@s3rgiosanchez

Director de la Oficina de Innovación en Emais.
Software Engineer.
PMP® & Scrum Master certified.
Opinions are my own.

ID: 82100986

linkhttps://www.linkedin.com/in/sergiosanchezalvarez/ calendar_today13-10-2009 14:00:31

2,2K Tweet

296 Followers

939 Following

Anthropic (@anthropicai) 's Twitter Profile Photo

New Anthropic research: Estimating AI productivity gains from Claude conversations. The Anthropic Economic Index tells us where Claude is used, and for which tasks. But it doesn’t tell us how useful Claude is. How much time does it save?

New Anthropic research: Estimating AI productivity gains from Claude conversations.

The Anthropic Economic Index tells us where Claude is used, and for which tasks. But it doesn’t tell us how useful Claude is. How much time does it save?
Runway (@runwayml) 's Twitter Profile Photo

Introducing our new frontier video model, Runway Gen-4.5. Previously known as Whisper Thunder (aka) David. Gen-4.5 is state-of-the-art and sets a new standard for video generation motion quality, prompt adherence and visual fidelity. Learn more below.

Poetiq (@poetiq_ai) 's Twitter Profile Photo

Poetiq has officially shattered the ARC-AGI-2 SOTA 🚀 ARC Prize has officially verified our results: - 54% Accuracy – first to break the 50% barrier! - $30.57 / problem – less than half the cost of the previous best! We are now #1 on the leaderboard for ARC-AGI-2!

Poetiq has officially shattered the ARC-AGI-2 SOTA 🚀

<a href="/arcprize/">ARC Prize</a> has officially verified our results:
- 54% Accuracy – first to break the 50% barrier!
- $30.57 / problem – less than half the cost of the previous best!

We are now #1 on the leaderboard for ARC-AGI-2!
Mckay Wrigley (@mckaywrigley) 's Twitter Profile Photo

Here are my Opus 4.5 thoughts after ~2 weeks of use. First some general thoughts, then some practical stuff. --- THE BIG PICTURE --- THE UNLOCK FOR AGENTS It's clear to anyone who's used Opus 4.5 that AI progress isn't slowing down. I'm surprised more people aren't treating

Simón Muñoz (@simonvlc) 's Twitter Profile Photo

"La elección ya no es si adoptar IA. Ese tren ya partió. La elección es si estar entre el 5% que está construyendo el futuro o el 95% que está intentando entenderlo." estrategiadeproducto.com/p/el-futuro-de…

OpenAI Newsroom (@openainewsroom) 's Twitter Profile Photo

OpenAI is co-founding the Agentic AI Foundation (AAIF) under the Linux Foundation alongside Anthropic and Block to support open, interoperable standards for agentic AI. We’re also donating AGENTS .md to help establish open standards that enable safe, reliable agents across

Matt Shumer (@mattshumer_) 's Twitter Profile Photo

I've had access to GPT-5.2 since November 25th. Since then, I've used it as my daily-driver, pushing it to its limits. It beats out Opus 4.5 in most things I tried, but there's a (big) catch. Here's my review of GPT-5.2: shumer.dev/gpt52review

ARC Prize (@arcprize) 's Twitter Profile Photo

A year ago, we verified a preview of an unreleased version of OpenAI o3 (High) that scored 88% on ARC-AGI-1 at est. $4.5k/task Today, we’ve verified a new GPT-5.2 Pro (X-High) SOTA score of 90.5% at $11.64/task This represents a ~390X efficiency improvement in one year

A year ago, we verified a preview of an unreleased version of <a href="/OpenAI/">OpenAI</a> o3 (High) that scored 88% on ARC-AGI-1 at est. $4.5k/task

Today, we’ve verified a new GPT-5.2 Pro (X-High) SOTA score of 90.5% at $11.64/task

This represents a ~390X efficiency improvement in one year
ARC Prize (@arcprize) 's Twitter Profile Photo

ARC-AGI-3 (2026) will drive AI capability and efficiency even further Designed to measure the ability of AI to efficiently learn and generalize in novel environments, it will be a first-of-its-kind Interactive Reasoning Benchmark Stay tuned

Javi López ⛩️ (@javilop) 's Twitter Profile Photo

🔥 ¡OPEN AI ESTÁ DE VUELTA! Sam Altman acaba de soltar los resultados de GPT-5.2 Thinking y es un auténtico monstruo. Altman lo llama un modelo MUY inteligente, y los benchmarks lo avalan → salto brutal sobre GPT-5.1, y se fuma a Claude Opus 4.5 y Gemini 3 Pro en pruebas

🔥 ¡OPEN AI ESTÁ DE VUELTA!

Sam Altman acaba de soltar los resultados de GPT-5.2 Thinking y es un auténtico monstruo.

Altman lo llama un modelo MUY inteligente, y los benchmarks lo avalan → salto brutal sobre GPT-5.1, y se fuma a Claude Opus 4.5 y Gemini 3 Pro en pruebas
Sergio Sánchez (@s3rgiosanchez) 's Twitter Profile Photo

Estos casos de uso los iremos viendo más y más conforme la adopción del vibe coding de los nuevos modelos vaya calando en los equipos de desarrollo. Una migración completa en 3 días, increíble.

Tibo (@thsottiaux) 's Twitter Profile Photo

GPT-5.2-Codex is out, further advancing our SoTA for professional software engineering and long-running agentic coding work. It improves on instruction following, long-context understanding, and pushes the frontier including on cyber. $ codex -m gpt-5.2-codex

GPT-5.2-Codex is out, further advancing our SoTA for professional software engineering and long-running agentic coding work.

It improves on instruction following, long-context understanding, and pushes the frontier including on cyber. 

$ codex -m gpt-5.2-codex
OpenAI Developers (@openaidevs) 's Twitter Profile Photo

🆕 Codex now officially supports skills Skills are reusable bundles of instructions, scripts, and resources that help Codex complete specific tasks. You can call a skill directly with $.skill-name, or let Codex choose the right one based on your prompt.

OpenAI Developers (@openaidevs) 's Twitter Profile Photo

📣 How we built the Codex agent loop Ever wonder what Codex does between your prompt and its response? Each turn assembles inputs, runs inference, executes tools, and feeds the results back into context until the loop ends openai.com/index/unrollin…

Vaibhav (VB) Srivastav (@reach_vb) 's Twitter Profile Photo

BOOOOM! Introducing GPT-5.3-Codex: our most capable agentic coding model yet 🔥 > Frontier coding + terminal skills with fewer tokens > Built for long-running tasks (research → tool use → execution) > Interactive mid-turn steering + frequent progress updates > Stronger default

BOOOOM! Introducing GPT-5.3-Codex: our most capable agentic coding model yet 🔥

&gt; Frontier coding + terminal skills with fewer tokens
&gt; Built for long-running tasks (research → tool use → execution)
&gt; Interactive mid-turn steering + frequent progress updates
&gt; Stronger default