Thomas Savage (@thomasrsavage) 's Twitter Profile
Thomas Savage

@thomasrsavage

Internal Medicine Physician, clinical educator, AI optimist

ID: 1756415634492321792

calendar_today10-02-2024 20:31:51

14 Tweet

13 Takipçi

41 Takip Edilen

npj Digital Medicine (@npjdigitalmed) 's Twitter Profile Photo

Mitigating the 'black box' of AI: LLMs can imitate diagnostic reasoning strategies when solving clinical cases & provide an interpretable means to assess if the generated answer is true/false based on the diagnostic reasoning's factual & logical accuracy. nature.com/articles/s4174…

Mitigating the 'black box' of AI: LLMs can imitate diagnostic reasoning strategies when solving clinical cases & provide an interpretable means to assess if the generated answer is true/false based on the diagnostic reasoning's factual & logical accuracy.

nature.com/articles/s4174…
Omar Khattab (@lateinteraction) 's Twitter Profile Photo

Long context will eventually work, then will eventually become less expensive and scale better. For now, though, the tradeoffs may not be great. (Must note this plot says 3000, not 10M.)

Jim Fan (@drjimfan) 's Twitter Profile Photo

We live in such strange times. Apple, a company famous for its secrecy, published a paper with staggering amount of details on their multimodal foundation model. Those who are supposed to be open are now wayyy less than Apple. MM1 is a treasure trove of analysis. They discuss

We live in such strange times. Apple, a company famous for its secrecy, published a paper with staggering amount of details on their multimodal foundation model. Those who are supposed to be open are now wayyy less than Apple.

MM1 is a treasure trove of analysis. They discuss
JAMA Internal Medicine (@jamainternalmed) 's Twitter Profile Photo

In this quasi-experimental study, a deterioration model intervention was found to be associated with a decreased risk of escalations in care during hospitalization. ja.ma/4aqjG7V

In this quasi-experimental study, a deterioration model intervention was found to be associated with a decreased risk of escalations in care during hospitalization. ja.ma/4aqjG7V
Anil Makam (@anilmakam) 's Twitter Profile Photo

Fascinating regression discontinuity study in JAMA Internal Medicine of Epic's deterioration index (EDI) by Rob Gallo, MD EDI alerts reduced rapid response & ICU transfers Though sensitive to bandwidth choice below & above the EDI threshold when the alert fires jamanetwork.com/journals/jamai…

Jonathan H Chen MD PhD (@jonc101x) 's Twitter Profile Photo

Increase your sample size by asking LLM chatbot the same question 100 times. Wait, maybe not? Rob Gallo, MD diving in on evaluating generative #AI. Repeated prompting like asking the same person a question or like random sampling from a population of people?jamanetwork.com/journals/jama/…

Thomas Savage (@thomasrsavage) 's Twitter Profile Photo

LLMs seem overconfident when responding to medical questions, so how do we know when they are actually uncertain? In our preprint we review strategies to estimate LLM uncertainty for medical diagnosis and treatment selection. medrxiv.org/content/10.110…

Thomas Savage (@thomasrsavage) 's Twitter Profile Photo

LLM fine tuning is surprisingly underused in medicine. With data siloed, we will need fine tuning to learn knowledge and preferences that are unique to our health systems . Here we show the benefits of SFT and DPO for many common medical nlp tasks (link: arxiv.org/pdf/2409.12741)

Andrew Ng (@andrewyng) 's Twitter Profile Photo

A decision on SB-1047 is due soon. Governor Gavin Newsom has said he's concerned about its "chilling effect, particularly in the open source community". He's right, and I hope he will veto this. If you agree, please like/retweet this to show your support for VETOing SB-1047!

Jonathan H Chen MD PhD (@jonc101x) 's Twitter Profile Photo

Large language model chatbot #AI systems are remarkably accurate on medical questions, but hard to use in high-stakes medicine when you're unsure how confident the answer is (chatbots have tendency to express high confidence, regardless of factuality). academic.oup.com/jamia/article-…

Penn LDI (@pennldi) 's Twitter Profile Photo

LDI Fellow Thomas Savage's study shows that large language models can estimate their uncertainty in medical diagnosis using sample consistency (SC) proxies, which proved most reliable for uncertainty detection. Learn more here. CC: Penn Medicine academic.oup.com/jamia/advance-…

npj Digital Medicine (@npjdigitalmed) 's Twitter Profile Photo

2🥈 Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine nature.com/articles/s4174…

2🥈 Diagnostic reasoning prompts reveal the potential for large language model interpretability in medicine nature.com/articles/s4174…
npj Digital Medicine (@npjdigitalmed) 's Twitter Profile Photo

Listen to Thomas Savage at Penn, in our first 2-minute Author Spotlight, discuss his work which was one of the top 10 cited papers at @npjdigitalmed in 2024!