The way we currently report human performance systematically underestimates it, making AI look better than it is.
CT scanning is just awful for diagnosing Covid-19
Reports that CT scanning may be better than PCR testing for covid-19 are flawed and almost certainly wrong.
Improving Medical AI Safety by Addressing Hidden Stratification
Medical AI testing is unsafe, but addressing hidden stratification may be a way to prevent harm, without upending the current regulatory environment.
Medical AI Safety: We have a problem.
For the first time ever AI systems can directly harm patients. Are we doing enough to prevent a medical AI tragedy, the equivalent of a thalidomide event?
The philosophical argument for using ROC curves
I just wanted to do a quick follow up to my recent blog post, which discussed the performance metrics I think might be appropriate for use in medical AI studies. One thing I didn't cover was the reason we might want to use multiple metrics, or the philosophy behind choosing the ones I did. So today, … Continue reading The philosophical argument for using ROC curves
Do machines actually beat doctors? ROC curves and performance metrics
Deep learning research in medicine is a bit like the Wild West at the moment; sometimes you find gold, sometimes a giant steampunk spider-bot causes a ruckus. This has derailed my series on whether AI will be replacing doctors soon, as I have felt the need to focus a bit more on how to assess … Continue reading Do machines actually beat doctors? ROC curves and performance metrics