TY - JOUR
T1 - Comparing the number and relevance of false activations between 2 artificial intelligence computer-aided detection systems
T2 - the NOISE study
AU - Spadaccini, Marco
AU - Hassan, Cesare
AU - Alfarone, Ludovico
AU - Da Rio, Leonardo
AU - Maselli, Roberta
AU - Carrara, Silvia
AU - Galtieri, Piera Alessia
AU - Pellegatta, Gaia
AU - Fugazza, Alessandro
AU - Koleth, Glenn
AU - Emmanuel, James
AU - Anderloni, Andrea
AU - Mori, Yuichi
AU - Wallace, Michael B.
AU - Sharma, Prateek
AU - Repici, Alessandro
N1 - Funding Information:
DISCLOSURE: The following authors disclosed financial relationships: C. Hassan, A. Repici: Consultant for Medtronic and Fuji. R. Maselli: Consultant for Fuji. S. Carrara, A. Anderloni: Consultant for Olympus Corp. Y. Mori: Consultant for Olympus Corp; ownership in Cybernet Corp. M. B. Wallace: Research funding from Medtronic; Research grants from Fujifilm, Boston Scientific, Olympus, Medtronic, Ninepoint Medical, and Cosmo/Aries Pharmaceuticals; Stock options in Virgo Inc; Minor food/beverage at meetings from Synergy Pharmaceuticals, Boston Scientific, and Cook Medical; Consultant on behalf of Mayo Clinic, paid to Mayo Clinic, for GI Supply (2018), Endokey, Endostart, Boston Scientific, Microtek, and Verily. All other authors disclosed no financial relationships.
Publisher Copyright:
© 2022
PY - 2022/5
Y1 - 2022/5
N2 - Background and Aims: Artificial intelligence has been shown to be effective in polyp detection, and multiple computer-aided detection (CADe) systems have been developed. False-positive (FP) activation emerged as a possible way to benchmark CADe performance in clinical practice. The aim of this study was to validate a previously developed classification of FPs comparing the performances of different brands of approved CADe systems. Methods: We compared 2 different consecutive video libraries (40 video per arm) collected at Humanitas Research Hospital with 2 different CADe system brands (CADe A and CADe B). For each video, the number of CADe false activations, cause, and time spent by the endoscopist to examine the area erroneously highlighted were reported. The FP activations were classified according to the previously developed classification of FPs (the NOISE classification) according to their cause and relevance. Results: In CADe A 1021 FP activations were registered across the 40 videos (25.5 ± 12.2 FPs per colonoscopy), whereas in CADe B 1028 were identified (25.7 ± 13.2 FPs per colonoscopy; P = .53). Among them, 22.9 ± 9.9 (89.8% in CADe A) and 22.1 ± 10.0 (86.0% in CADe B) were because of artifacts from the bowel wall. Conversely, 2.6 ± 1.9 (10.2% in CADe A) and 3.5 ± 2.1 (14% in CADe B) were caused by bowel content (P = .45). Within CADe A each false activation required .2 ± .9 seconds, with 1.6 ± 1.0 FPs (6.3%) requiring additional time for endoscopic assessment. Comparable results were reported within CADe B with .2 ± .8 seconds spent per false activation and 1.8 ± 1.2 FPs per colonoscopy requiring additional inspection. Conclusions: The use of a standardized nomenclature provided comparable results with either of the 2 recently approved CADe systems. (Clinical trial registration number: NCT04399590.)
AB - Background and Aims: Artificial intelligence has been shown to be effective in polyp detection, and multiple computer-aided detection (CADe) systems have been developed. False-positive (FP) activation emerged as a possible way to benchmark CADe performance in clinical practice. The aim of this study was to validate a previously developed classification of FPs comparing the performances of different brands of approved CADe systems. Methods: We compared 2 different consecutive video libraries (40 video per arm) collected at Humanitas Research Hospital with 2 different CADe system brands (CADe A and CADe B). For each video, the number of CADe false activations, cause, and time spent by the endoscopist to examine the area erroneously highlighted were reported. The FP activations were classified according to the previously developed classification of FPs (the NOISE classification) according to their cause and relevance. Results: In CADe A 1021 FP activations were registered across the 40 videos (25.5 ± 12.2 FPs per colonoscopy), whereas in CADe B 1028 were identified (25.7 ± 13.2 FPs per colonoscopy; P = .53). Among them, 22.9 ± 9.9 (89.8% in CADe A) and 22.1 ± 10.0 (86.0% in CADe B) were because of artifacts from the bowel wall. Conversely, 2.6 ± 1.9 (10.2% in CADe A) and 3.5 ± 2.1 (14% in CADe B) were caused by bowel content (P = .45). Within CADe A each false activation required .2 ± .9 seconds, with 1.6 ± 1.0 FPs (6.3%) requiring additional time for endoscopic assessment. Comparable results were reported within CADe B with .2 ± .8 seconds spent per false activation and 1.8 ± 1.2 FPs per colonoscopy requiring additional inspection. Conclusions: The use of a standardized nomenclature provided comparable results with either of the 2 recently approved CADe systems. (Clinical trial registration number: NCT04399590.)
UR - http://www.scopus.com/inward/record.url?scp=85126548739&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126548739&partnerID=8YFLogxK
U2 - 10.1016/j.gie.2021.12.031
DO - 10.1016/j.gie.2021.12.031
M3 - Article
C2 - 34995639
AN - SCOPUS:85126548739
SN - 0016-5107
VL - 95
SP - 975-981.e1
JO - Gastrointestinal endoscopy
JF - Gastrointestinal endoscopy
IS - 5
ER -