Ethan Mollick (@emollick) 's Twitter Profile
Ethan Mollick

@emollick

Professor @Wharton studying AI, innovation & startups. Democratizing education using tech
Book: a.co/d/4VguzZz
Substack: oneusefulthing.org

ID: 39125788

linkhttps://mgmt.wharton.upenn.edu/profile/emollick/ calendar_today10-05-2009 22:33:52

31,31K Tweet

266,266K Takipçi

568 Takip Edilen

Ethan Mollick (@emollick) 's Twitter Profile Photo

When reading AI benchmarks, aside from the fact that many of the AIs are (accidentally or on purpose) trained on the test set, many tests are just bad. MMLU likely maxes out at 90% or so because so many of the questions in it are just wrong. It is also uncalibrated in difficulty.

Ethan Mollick (@emollick) 's Twitter Profile Photo

I have taught an entrepreneurship class for 15 years. I just had an online Q&A with a group of entrepreneurs taking a similar class. As an experiment, I put their questions into o3. The answers were all very good. (The examples here are not from the students, but are typical)

I have taught an entrepreneurship class for 15 years. I just had an online Q&A with a group of entrepreneurs taking a similar class. As an experiment, I put their questions into o3.

The answers were all very good. (The examples here are not from the students, but are typical)
Ethan Mollick (@emollick) 's Twitter Profile Photo

As happens to every AI term once it becomes used in marketing (see also "agents" and "vibe coding"), I regret to inform you that no one agrees on a common definition of "digital twin," let alone what a digital twin actually does.

Ethan Mollick (@emollick) 's Twitter Profile Photo

A weird thing about LLMs is that they just happen to do many things but almost all uses are undocumented. For example, GPT-4o is very good at helping farmers identify swine diseases. There is a lot of value in experts exploring & benchmarking how good LLMs are at various tasks.

A weird thing about LLMs is that they just happen to do many things but almost all uses are undocumented.

For example, GPT-4o is very good at helping farmers identify swine diseases.

There is a lot of value in experts exploring & benchmarking how good LLMs are at various tasks.