Highlights • There may be an added value of AI model supported imaging-based COVID-19 detection. • Studies reported comparable or better performance of AI or AI-supported readings. • There was lower variability of diagnostic performance for AI than for human readers. • Our systematic review shows heterogeneity of data characteristics and risks of bias. • There is a variety of applied methodologies and statistical analysis limitations. Purpose A growing number of studies have examined whether Artificial Intelligence (AI) systems can support imaging-based diagnosis of COVID-19-caused pneumonia, including both gains in diagnostic performance and speed. However, what is currently missing is a combined appreciation of studies comparing human readers and AI. Methods We followed PRISMA-DTA guidelines for our systematic review, searching EMBASE, PUBMED and Scopus databases. To gain insights into the potential value of AI methods, we focused on studies comparing the performance of human readers versus AI models or versus AI-supported human readings. Results Our search identified 1270 studies, of which 12 fulfilled specific selection criteria. Concerning diagnostic performance, in testing datasets reported sensitivity was 42–100% (human readers, n = 9 studies), 60–95% (AI systems, n = 10) and 81–98% (AI-supported readers, n = 3), whilst reported specificity was 26–100% (human readers, n = 8), 61–96% (AI systems, n = 10) and 78–99% (AI-supported readings, n = 2). One study highlighted the potential of AI-supported readings for the assessment of lung lesion burden changes, whilst two studies indicated potential time savings for detection with AI. Conclusions Our review indicates that AI systems or AI-supported human readings show less performance variability (interquartile range) in general, and may support the differentiation of COVID-19 pneumonia from other forms of pneumonia when used in high-prevalence and symptomatic populations. However, inconsistencies related to study design, reporting of data, areas of risk of bias, as well as limitations of statistical analyses complicate clear conclusions. We therefore support efforts for developing critical elements of study design when assessing the value of AI for diagnostic imaging.
【초록키워드】 COVID-19, COVID-19 pneumonia, Pneumonia, artificial intelligence, diagnostic, Diagnosis, lung, systematic review, heterogeneity, sensitivity, specificity, Characteristics, artificial, symptomatic, Diagnostic imaging, Risk of bias, Study design, Reporting, dataset, methodology, differentiation, speed, Critical, statistical analysis, criteria, lung lesion, Support, changes, interquartile range, Variability, effort, statistical analyses, Highlights, limitation, element, limitations, populations, intelligence, Result, examined, reported, indicated, form, applied, supported, indicate, added, less, variety, comparable, complicate, risks of bias, statistical analysis, 【제목키워드】 COVID-19, added,