Decoding radiology reports: Potential application of OpenAI ChatGPT to enhance patient understanding of diagnostic reports

Hanzhou Li; John T. Moon; Deepak Iyer; Patricia Balthazar; Elizabeth A. Krupinski; Zachary L. Bercu; Janice M. Newsome; Imon Banerjee; Judy W. Gichoya; Hari M. Trivedi

doi:10.1016/j.clinimag.2023.06.008

Decoding radiology reports: Potential application of OpenAI ChatGPT to enhance patient understanding of diagnostic reports

Hanzhou Li, John T. Moon, Deepak Iyer, Patricia Balthazar, Elizabeth A. Krupinski, Zachary L. Bercu, Janice M. Newsome, Imon Banerjee, Judy W. Gichoya, Hari M. Trivedi

Diagnostic Radiology

Research output: Contribution to journal › Article › peer-review

Abstract

Purpose: To evaluate the complexity of diagnostic radiology reports across major imaging modalities and the ability of ChatGPT (Early March 2023 Version, OpenAI, California, USA) to simplify these reports to the 8th grade reading level of the average U.S. adult. Methods: We randomly sampled 100 radiographs (XR), 100 ultrasound (US), 100 CT, and 100 MRI radiology reports from our institution's database dated between 2022 and 2023 (N = 400). These were processed by ChatGPT using the prompt “Explain this radiology report to a patient in layman's terms in second person: <Report Text>”. Mean report length, Flesch reading ease score (FRES), and Flesch-Kincaid reading level (FKRL) were calculated for each report and ChatGPT output. T-tests were used to determine significance. Results: Mean report length was 164 ± 117 words, FRES was 38.0 ± 11.8, and FKRL was 10.4 ± 1.9. FKRL was significantly higher for CT and MRI than for US and XR. Only 60/400 (15%) had a FKRL <8.5. The mean simplified ChatGPT output length was 103 ± 36 words, FRES was 83.5 ± 5.6, and FKRL was 5.8 ± 1.1. This reflects a mean decrease of 61 words (p < 0.01), increase in FRES of 45.5 (p < 0.01), and decrease in FKRL of 4.6 (p < 0.01). All simplified outputs had FKRL <8.5. Discussion: Our study demonstrates the effective use of ChatGPT when tasked with simplifying radiology reports to below the 8th grade reading level. We report significant improvements in FRES, FKRL, and word count, the last of which requires modality-specific context.

Original language	English (US)
Pages (from-to)	137-141
Number of pages	5
Journal	Clinical Imaging
Volume	101
DOIs	https://doi.org/10.1016/j.clinimag.2023.06.008
State	Published - Sep 2023

Keywords

21st century cures act
Large language model
Natural language processing
Patient-centered reports

ASJC Scopus subject areas

Radiology Nuclear Medicine and imaging

Access to Document

10.1016/j.clinimag.2023.06.008

Cite this

@article{c12541c708294533b6f796f39371dbce,

title = "Decoding radiology reports: Potential application of OpenAI ChatGPT to enhance patient understanding of diagnostic reports",

abstract = "Purpose: To evaluate the complexity of diagnostic radiology reports across major imaging modalities and the ability of ChatGPT (Early March 2023 Version, OpenAI, California, USA) to simplify these reports to the 8th grade reading level of the average U.S. adult. Methods: We randomly sampled 100 radiographs (XR), 100 ultrasound (US), 100 CT, and 100 MRI radiology reports from our institution's database dated between 2022 and 2023 (N = 400). These were processed by ChatGPT using the prompt “Explain this radiology report to a patient in layman's terms in second person: <Report Text>”. Mean report length, Flesch reading ease score (FRES), and Flesch-Kincaid reading level (FKRL) were calculated for each report and ChatGPT output. T-tests were used to determine significance. Results: Mean report length was 164 ± 117 words, FRES was 38.0 ± 11.8, and FKRL was 10.4 ± 1.9. FKRL was significantly higher for CT and MRI than for US and XR. Only 60/400 (15%) had a FKRL <8.5. The mean simplified ChatGPT output length was 103 ± 36 words, FRES was 83.5 ± 5.6, and FKRL was 5.8 ± 1.1. This reflects a mean decrease of 61 words (p < 0.01), increase in FRES of 45.5 (p < 0.01), and decrease in FKRL of 4.6 (p < 0.01). All simplified outputs had FKRL <8.5. Discussion: Our study demonstrates the effective use of ChatGPT when tasked with simplifying radiology reports to below the 8th grade reading level. We report significant improvements in FRES, FKRL, and word count, the last of which requires modality-specific context.",

keywords = "21st century cures act, Large language model, Natural language processing, Patient-centered reports",

author = "Hanzhou Li and Moon, {John T.} and Deepak Iyer and Patricia Balthazar and Krupinski, {Elizabeth A.} and Bercu, {Zachary L.} and Newsome, {Janice M.} and Imon Banerjee and Gichoya, {Judy W.} and Trivedi, {Hari M.}",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Inc.",

year = "2023",

month = sep,

doi = "10.1016/j.clinimag.2023.06.008",

language = "English (US)",

volume = "101",

pages = "137--141",

journal = "Clinical Imaging",

issn = "0899-7071",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Decoding radiology reports

T2 - Potential application of OpenAI ChatGPT to enhance patient understanding of diagnostic reports

AU - Li, Hanzhou

AU - Moon, John T.

AU - Iyer, Deepak

AU - Balthazar, Patricia

AU - Krupinski, Elizabeth A.

AU - Bercu, Zachary L.

AU - Newsome, Janice M.

AU - Banerjee, Imon

AU - Gichoya, Judy W.

AU - Trivedi, Hari M.

PY - 2023/9

Y1 - 2023/9

N2 - Purpose: To evaluate the complexity of diagnostic radiology reports across major imaging modalities and the ability of ChatGPT (Early March 2023 Version, OpenAI, California, USA) to simplify these reports to the 8th grade reading level of the average U.S. adult. Methods: We randomly sampled 100 radiographs (XR), 100 ultrasound (US), 100 CT, and 100 MRI radiology reports from our institution's database dated between 2022 and 2023 (N = 400). These were processed by ChatGPT using the prompt “Explain this radiology report to a patient in layman's terms in second person: <Report Text>”. Mean report length, Flesch reading ease score (FRES), and Flesch-Kincaid reading level (FKRL) were calculated for each report and ChatGPT output. T-tests were used to determine significance. Results: Mean report length was 164 ± 117 words, FRES was 38.0 ± 11.8, and FKRL was 10.4 ± 1.9. FKRL was significantly higher for CT and MRI than for US and XR. Only 60/400 (15%) had a FKRL <8.5. The mean simplified ChatGPT output length was 103 ± 36 words, FRES was 83.5 ± 5.6, and FKRL was 5.8 ± 1.1. This reflects a mean decrease of 61 words (p < 0.01), increase in FRES of 45.5 (p < 0.01), and decrease in FKRL of 4.6 (p < 0.01). All simplified outputs had FKRL <8.5. Discussion: Our study demonstrates the effective use of ChatGPT when tasked with simplifying radiology reports to below the 8th grade reading level. We report significant improvements in FRES, FKRL, and word count, the last of which requires modality-specific context.

AB - Purpose: To evaluate the complexity of diagnostic radiology reports across major imaging modalities and the ability of ChatGPT (Early March 2023 Version, OpenAI, California, USA) to simplify these reports to the 8th grade reading level of the average U.S. adult. Methods: We randomly sampled 100 radiographs (XR), 100 ultrasound (US), 100 CT, and 100 MRI radiology reports from our institution's database dated between 2022 and 2023 (N = 400). These were processed by ChatGPT using the prompt “Explain this radiology report to a patient in layman's terms in second person: <Report Text>”. Mean report length, Flesch reading ease score (FRES), and Flesch-Kincaid reading level (FKRL) were calculated for each report and ChatGPT output. T-tests were used to determine significance. Results: Mean report length was 164 ± 117 words, FRES was 38.0 ± 11.8, and FKRL was 10.4 ± 1.9. FKRL was significantly higher for CT and MRI than for US and XR. Only 60/400 (15%) had a FKRL <8.5. The mean simplified ChatGPT output length was 103 ± 36 words, FRES was 83.5 ± 5.6, and FKRL was 5.8 ± 1.1. This reflects a mean decrease of 61 words (p < 0.01), increase in FRES of 45.5 (p < 0.01), and decrease in FKRL of 4.6 (p < 0.01). All simplified outputs had FKRL <8.5. Discussion: Our study demonstrates the effective use of ChatGPT when tasked with simplifying radiology reports to below the 8th grade reading level. We report significant improvements in FRES, FKRL, and word count, the last of which requires modality-specific context.

KW - 21st century cures act

KW - Large language model

KW - Natural language processing

KW - Patient-centered reports

UR - http://www.scopus.com/inward/record.url?scp=85162156868&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85162156868&partnerID=8YFLogxK

U2 - 10.1016/j.clinimag.2023.06.008

DO - 10.1016/j.clinimag.2023.06.008

M3 - Article

C2 - 37336169

AN - SCOPUS:85162156868

SN - 0899-7071

VL - 101

SP - 137

EP - 141

JO - Clinical Imaging

JF - Clinical Imaging

ER -

Decoding radiology reports: Potential application of OpenAI ChatGPT to enhance patient understanding of diagnostic reports

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this