Docs are ROCs: a simple fix for a “methodologically indefensible” practice in medical AI studies.

December 8, 2020 ~ laurenoakdenrayner ~ 21 Comments

The way we currently report human performance systematically underestimates it, making AI look better than it is.

CT scanning is just awful for diagnosing Covid-19

March 23, 2020March 26, 2020 ~ laurenoakdenrayner ~ 23 Comments

Reports that CT scanning may be better than PCR testing for covid-19 are flawed and almost certainly wrong.

Improving Medical AI Safety by Addressing Hidden Stratification

October 14, 2019October 16, 2019 ~ laurenoakdenrayner ~ 29 Comments

Medical AI testing is unsafe, but addressing hidden stratification may be a way to prevent harm, without upending the current regulatory environment.

Medical AI Safety: We have a problem.

July 11, 2018July 14, 2018 ~ laurenoakdenrayner ~ 71 Comments

For the first time ever AI systems can directly harm patients. Are we doing enough to prevent a medical AI tragedy, the equivalent of a thalidomide event?

The philosophical argument for using ROC curves

January 7, 2018January 28, 2018 ~ laurenoakdenrayner ~ 22 Comments

I just wanted to do a quick follow up to my recent blog post, which discussed the performance metrics I think might be appropriate for use in medical AI studies. One thing I didn't cover was the reason we might want to use multiple metrics, or the philosophy behind choosing the ones I did. So today, … Continue reading The philosophical argument for using ROC curves

Do machines actually beat doctors? ROC curves and performance metrics

December 6, 2017January 5, 2018 ~ laurenoakdenrayner ~ 41 Comments

Deep learning research in medicine is a bit like the Wild West at the moment; sometimes you find gold, sometimes a giant steampunk spider-bot causes a ruckus. This has derailed my series on whether AI will be replacing doctors soon, as I have felt the need to focus a bit more on how to assess … Continue reading Do machines actually beat doctors? ROC curves and performance metrics