Benjamin Hou
@farrell192
ID: 880907581732081666
30-06-2017 21:55:08
54 Tweet
128 Takipçi
150 Takip Edilen
🤔 LLMs such as ChatGPT score high in multi-choice medical questions, but do they have solid rationale? 🤯 Our evaluations revealed the hidden flaws in their decision-making. 🥳 Excited to share our latest study published with Nature Portfolio in npj Digital Medicine. 🧵(1/N)