Matthias D. Mahlendorf (@mdmahlendorf) 's Twitter Profile
Matthias D. Mahlendorf

@mdmahlendorf

Professor @FrankfurtSchool studying performance measurement, sustainability & the digital transformation of management control | Associate Editor EAR & MAR

ID: 832592311209033729

linkhttps://www.frankfurt-school.de/en/home/research/staff/Matthias-Mahlendorf calendar_today17-02-2017 14:07:30

240 Tweet

216 Followers

107 Following

Peter Yang (@petergyang) 's Twitter Profile Photo

Updated my AI model capabilities table: - Gemini 2.5 is now the best at coding - ChatGPT is by far the best at image gen This stuff changes fast.

Updated my AI model capabilities table:

- Gemini 2.5 is now the best at coding
- ChatGPT is by far the best at image gen

This stuff changes fast.
Chubby♨️ (@kimmonismus) 's Twitter Profile Photo

This is a surprise: Germany is in first place for the use of GenAI in the working environment. It is the fastest applied technology ever and is massively revolutionizing the everyday world! (Source: Deloitte)

This is a surprise: Germany is in first place for the use of GenAI in the working environment.

It is the fastest applied technology ever and is massively revolutionizing the everyday world!

(Source: Deloitte)
Marek Kwiek / Social Stratification in Science (@marek_kwiek) 's Twitter Profile Photo

Interesting! Junior researchers who coauthor work with top scientists enjoy a persistent competitive advantage throughout the rest of their careers, compared to peers with similar early career profiles but without top coauthors. nature.com/articles/s4146…

Ethan Mollick (@emollick) 's Twitter Profile Photo

o3 now cracks new Harvard Business School cases from the PDF, in one shot I blurred the figures to not ruin the case, but I asked the AI to figure out financials, which incorporates data scattered throughout the case. More interesting, I asked it to compare to the case's answer.

o3 now cracks new Harvard Business School cases from the PDF, in one shot

I blurred the figures to not ruin the case, but I asked the AI to figure out financials, which incorporates data scattered throughout the case. More interesting, I asked it to compare to the case's answer.
Ursula von der Leyen (@vonderleyen) 's Twitter Profile Photo

Europe’s choice is clear. To put science at the heart of its economy. To become the home of scientific freedom and collaboration. And to welcome talent from all over the world. I’m glad to present the first elements of our Choose Europe Initiative ↓ x.com/i/broadcasts/1…

MIT Shaping the Future of Work Initiative (@mitshapingwork) 's Twitter Profile Photo

Co-director Daron Acemoglu explains why the centralized nature of the social media ecosystem, which puts an enormous amount of power in the hands of a few platforms, is a threat to democracy — and why AI may be headed in the same direction.

Austen Allred (@austen) 's Twitter Profile Photo

Tomorrow wraps up the first 5 months of my kids attending the school where all of the learning is powered by AI (and no teachers). I get asked almost every day what the experience is like. In short: it's truly wild. Here's what it's like overall:

Tomorrow wraps up the first 5 months of my kids attending the school where all of the learning is powered by AI (and no teachers).

I get asked almost every day what the experience is like.

In short: it's truly wild.

Here's what it's like overall:
Neil deGrasse Tyson (@neiltyson) 's Twitter Profile Photo

Every machine in a Hospital that diagnoses your body without cutting you open is based on a principle of Physics, discovered by a Physicist who had no interest in Medicine. If you think the world doesn’t need Basic Science, or that somehow Science has failed you, think again.

Every machine in a Hospital that diagnoses your body without cutting you open is based on a principle of Physics, discovered by a Physicist who had no interest in Medicine. If you think the world doesn’t need Basic Science, or that somehow Science has failed you, think again.
Gary Marcus (@garymarcus) 's Twitter Profile Photo

Daddy, daddy, what was it like doing a PhD back when people wrote their own articles? You know, before people snuck in hidden prompts (like “do not to highlight any negatives” in hidden text) in their papers, so as to fake out reviewers who used AI to write their reviews?

Daddy, daddy, what was it like doing a PhD back when people wrote their own articles? 

You know, before people snuck in hidden prompts (like “do not to highlight any negatives” in hidden text) in their papers, so as to fake out reviewers who used AI to write their reviews?
Bearly AI (@bearlyai) 's Twitter Profile Photo

Capex spend on AI data centres has reached 1.2% of US GDP. That has surpassed Dotcom era spend on telecom equipment (1%) but still trails railroad spend in 1880s (6%).

Capex spend on AI data centres has reached 1.2% of US GDP.

That has surpassed Dotcom era spend on telecom equipment (1%) but still trails railroad spend in 1880s (6%).
Ethan Mollick (@emollick) 's Twitter Profile Photo

I have previously been against any use of detectors due to false positives & the fact they only caught non-sophisticated users. I think there is still a lot of reason to be worried about how they are used & what it means to “cheat” with AI, but my original reasons are less true.

OpenAI (@openai) 's Twitter Profile Photo

Today we’re introducing GDPval, a new evaluation that measures AI on real-world, economically valuable tasks. Evals ground progress in evidence instead of speculation and help track how AI improves at the kind of work that matters most. openai.com/index/gdpval-v0

Alex Prompter (@alex_prompter) 's Twitter Profile Photo

🚨 MIT and Basis Research just dropped a new way to measure if AI actually understands the world and the results are brutal. It’s called "WorldTest", and it doesn’t just check how well an AI predicts the next frame or maximizes reward. It checks whether the model can build an

🚨 MIT and Basis Research just dropped a new way to measure if AI actually understands the world and the results are brutal.

It’s called "WorldTest", and it doesn’t just check how well an AI predicts the next frame or maximizes reward.

It checks whether the model can build an
DAIR.AI (@dair_ai) 's Twitter Profile Photo

First large-scale study of AI agents actually running in production. The hype says agents are transforming everything. The data tells a different story. Researchers surveyed 306 practitioners and conducted 20 in-depth case studies across 26 domains. What they found challenges

First large-scale study of AI agents actually running in production.

The hype says agents are transforming everything. The data tells a different story.

Researchers surveyed 306 practitioners and conducted 20 in-depth case studies across 26 domains. What they found challenges
Ethan Mollick (@emollick) 's Twitter Profile Photo

Whoa. This new GDPval score is a very big deal. Probably the most economically relevant measure of AI ability suggesting that in head-to-head competition with human experts on tasks that require 4-8 hours for a human to do, GPT-5.2 wins 71% of the time as judged by other humans

Whoa. This new GDPval score is a very big deal.

Probably the most economically relevant measure of AI ability suggesting that in head-to-head competition with human experts on tasks that require 4-8 hours for a human to do, GPT-5.2 wins 71% of the time as judged by other humans