TY - JOUR
T1 - A DICOM Framework for Machine Learning and Processing Pipelines Against Real-time Radiology Images
AU - Kathiravelu, Pradeeban
AU - Sharma, Puneet
AU - Sharma, Ashish
AU - Banerjee, Imon
AU - Trivedi, Hari
AU - Purkayastha, Saptarshi
AU - Sinha, Priyanshu
AU - Cadrin-Chenevert, Alexandre
AU - Safdar, Nabile
AU - Gichoya, Judy Wawira
N1 - Funding Information:
This work was supported in part by The Cancer Imaging Archive (TCIA) Sustainment and Scalability Platforms for Quantitative Imaging Informatics in Precision Medicine [National Institute of Health (NIH) National Cancer Institute (NCI)] under Grant U24CA215109, and in part by the Methods and Tools for Integrating Pathomics Data into Cancer Registries (NIH NCI) under Grant UH3CA225021.
Publisher Copyright:
© 2021, The Author(s).
PY - 2021/8
Y1 - 2021/8
N2 - Real-time execution of machine learning (ML) pipelines on radiology images is difficult due to limited computing resources in clinical environments, whereas running them in research clusters requires efficient data transfer capabilities. We developed Niffler, an open-source Digital Imaging and Communications in Medicine (DICOM) framework that enables ML and processing pipelines in research clusters by efficiently retrieving images from the hospitals’ PACS and extracting the metadata from the images. We deployed Niffler at our institution (Emory Healthcare, the largest healthcare network in the state of Georgia) and retrieved data from 715 scanners spanning 12 sites, up to 350 GB/day continuously in real-time as a DICOM data stream over the past 2 years. We also used Niffler to retrieve images bulk on-demand based on user-provided filters to facilitate several research projects. This paper presents the architecture and three such use cases of Niffler. First, we executed an IVC filter detection and segmentation pipeline on abdominal radiographs in real-time, which was able to classify 989 test images with an accuracy of 96.0%. Second, we applied the Niffler Metadata Extractor to understand the operational efficiency of individual MRI systems based on calculated metrics. We benchmarked the accuracy of the calculated exam time windows by comparing Niffler against the Clinical Data Warehouse (CDW). Niffler accurately identified the scanners’ examination timeframes and idling times, whereas CDW falsely depicted several exam overlaps due to human errors. Third, with metadata extracted from the images by Niffler, we identified scanners with misconfigured time and reconfigured five scanners. Our evaluations highlight how Niffler enables real-time ML and processing pipelines in a research cluster.
AB - Real-time execution of machine learning (ML) pipelines on radiology images is difficult due to limited computing resources in clinical environments, whereas running them in research clusters requires efficient data transfer capabilities. We developed Niffler, an open-source Digital Imaging and Communications in Medicine (DICOM) framework that enables ML and processing pipelines in research clusters by efficiently retrieving images from the hospitals’ PACS and extracting the metadata from the images. We deployed Niffler at our institution (Emory Healthcare, the largest healthcare network in the state of Georgia) and retrieved data from 715 scanners spanning 12 sites, up to 350 GB/day continuously in real-time as a DICOM data stream over the past 2 years. We also used Niffler to retrieve images bulk on-demand based on user-provided filters to facilitate several research projects. This paper presents the architecture and three such use cases of Niffler. First, we executed an IVC filter detection and segmentation pipeline on abdominal radiographs in real-time, which was able to classify 989 test images with an accuracy of 96.0%. Second, we applied the Niffler Metadata Extractor to understand the operational efficiency of individual MRI systems based on calculated metrics. We benchmarked the accuracy of the calculated exam time windows by comparing Niffler against the Clinical Data Warehouse (CDW). Niffler accurately identified the scanners’ examination timeframes and idling times, whereas CDW falsely depicted several exam overlaps due to human errors. Third, with metadata extracted from the images by Niffler, we identified scanners with misconfigured time and reconfigured five scanners. Our evaluations highlight how Niffler enables real-time ML and processing pipelines in a research cluster.
KW - Clinical data warehouse (CDW)
KW - Digital Imaging and Communications in Medicine (DICOM)
KW - Machine learning (ML)
KW - Picture archiving and communication system (PACS)
UR - http://www.scopus.com/inward/record.url?scp=85112787014&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112787014&partnerID=8YFLogxK
U2 - 10.1007/s10278-021-00491-w
DO - 10.1007/s10278-021-00491-w
M3 - Article
C2 - 34405297
AN - SCOPUS:85112787014
SN - 0897-1889
VL - 34
SP - 1005
EP - 1013
JO - Journal of Digital Imaging
JF - Journal of Digital Imaging
IS - 4
ER -