Mark Zobeck (@markzobeck) 's Twitter Profile
Mark Zobeck

@markzobeck

MD, MPH. Peds Hem/Onc. Data nerd using technology to improve healthcare for kids around the world. Love languages: graphs, references, Bayesian statistics

ID: 951960446

linkhttps://www.thedoctorsdialectic.com/ calendar_today16-11-2012 16:13:34

1,1K Tweet

1,1K Followers

1,1K Following

Mark Zobeck (@markzobeck) 's Twitter Profile Photo

Prediction model recipe for disaster: Have 40 events. Screen 2000 SNPs, retain p<0.1. Step-wise slctn to throw SNPs+clinical vars into a model. Get AUC 0.87 (pic 1). Publish! The story often stops there. Thankfully, they validated, and it, unsurprisingly, fell apart (pic 2).

Prediction model recipe for disaster: 

Have 40 events. Screen 2000 SNPs, retain p&lt;0.1. Step-wise slctn to throw SNPs+clinical vars into a model. Get AUC 0.87 (pic 1). Publish!

The story often stops there.

Thankfully, they validated, and it, unsurprisingly, fell apart (pic 2).
Mark Zobeck (@markzobeck) 's Twitter Profile Photo

⚠️! "While many researchers may understand the AUC as an intrinsic measure of a clinical prediction model's (CPM) quality, in fact AUC is an extrinsic property of a CPM that emerges only when a model is applied to a specific population."

Mark Zobeck (@markzobeck) 's Twitter Profile Photo

Scale is all you need. Once the training dataset contains Aleph-0 observations, AGI will be able to multiply all the integers.

David Duvenaud (@davidduvenaud) 's Twitter Profile Photo

LLMs have complex joint beliefs about all sorts of quantities. And my postdoc James Requeima visualized them! In this thread we show LLM predictive distributions conditioned on data and free-form text. LLMs pick up on all kinds of subtle and unusual structure: đź§µ

Kareem Carr, Statistics Person (@kareem_carr) 's Twitter Profile Photo

Most people don't know this but AI is just statistics. It's a special kind of statistics where you can use data without permission, spend billions of dollars of resources on a single model, and you aren't personally responsible when it fails in idiotic ways and/or hurts people.

Elias Bareinboim (@eliasbareinboim) 's Twitter Profile Photo

I understand that CI, from the 1970s until around 2010, was mostly focused on the challenge of moving from OBS to EXP worlds and controlling for confounding in this sense. However, it's an oversimplification to think about CI as solely about observational studies, as the

I understand that CI, from the 1970s until around 2010, was mostly focused on the challenge of moving from OBS to EXP worlds and controlling for confounding in this sense. However, it's an oversimplification to think about CI as solely about observational studies, as the
Ming "Tommy" Tang (@tangming2005) 's Twitter Profile Photo

Common statistical tests are linear models (or: how to teach stats) lindeloev.github.io/tests-as-linea… I wish I were taught like this.

Common statistical tests are linear models (or: how to teach stats) lindeloev.github.io/tests-as-linea…
I wish I were taught like this.
Mark Zobeck (@markzobeck) 's Twitter Profile Photo

You know it's U.S. News and World Report Voting Season when random division chiefs reach out to be your best friend on Doximity.

Frank Harrell (@f2harrell) 's Twitter Profile Photo

New chapter in Regression Modeling Strategies #rms: ordinal regression generalizes Wilcoxon, log-rank, Kaplan-Meier, Cox PH model, and all common survival time analyses. Semiparametric regression is a unifying concept. hbiostat.org/rmsc/ordsurv #rstats #Statistics

Alec Helbling (@alec_helbling) 's Twitter Profile Photo

This is a fun interactive article explaining the intuition behind Gaussian Processes. Gaussian Processes allow you to construct a distribution over functions with a continuous domain. They are incredibly useful for applications like Bayesian regression.

Pavlos Msaouel (@pavlosmsaouel) 's Twitter Profile Photo

We are excited to host Miguel Hernán in Houston MD Anderson Cancer Center next week to give this year’s Melvin L. Samuels lecture (hybrid; zoom info in photo). Looking forward to the synergies that will emerge from his visit to advance rigorous causal inference methods across oncology.

We are excited to host <a href="/_MiguelHernan/">Miguel Hernán</a> in Houston <a href="/MDAndersonNews/">MD Anderson Cancer Center</a> next week to give this year’s Melvin L. Samuels lecture (hybrid; zoom info in photo). Looking forward to the synergies that will emerge from his visit to advance rigorous causal inference methods across oncology.
Frank Harrell (@f2harrell) 's Twitter Profile Photo

Very interesting paper on causal inference from statistical models; perhaps more actionable than most causal inference papers: tandfonline.com/doi/full/10.10… #Statistics