Automated Detection of Periprosthetic Joint Infections and Data Elements Using Natural Language Processing

Sunyang Fu; Cody C. Wyles; Douglas R. Osmon; Martha L. Carvour; Elham Sagheb; Taghi Ramazanian; Walter K. Kremers; David G. Lewallen; Daniel J. Berry; Sunghwan Sohn; Hilal Maradit Kremers

doi:10.1016/j.arth.2020.07.076

Automated Detection of Periprosthetic Joint Infections and Data Elements Using Natural Language Processing

Sunyang Fu, Cody C. Wyles, Douglas R. Osmon, Martha L. Carvour, Elham Sagheb, Taghi Ramazanian, Walter K. Kremers, David G. Lewallen, Daniel J. Berry, Sunghwan Sohn, Hilal Maradit Kremers

Quantitative Health Sciences

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

Background: Periprosthetic joint infection (PJI) data elements are contained in both structured and unstructured documents in electronic health records and require manual data collection. The goal of this study is to develop a natural language processing (NLP) algorithm to replicate manual chart review for PJI data elements. Methods: PJI was identified among all total joint arthroplasty (TJA) procedures performed at a single academic institution between 2000 and 2017. Data elements that comprise the Musculoskeletal Infection Society (MSIS) criteria were manually extracted and used as the gold standard for validation. A training sample of 1208 TJA surgeries (170 PJI cases) was randomly selected to develop the prototype NLP algorithms and an additional 1179 surgeries (150 PJI cases) were randomly selected as the test sample. The algorithms were applied to all consultation notes, operative notes, pathology reports, and microbiology reports to predict the correct status of PJI based on MSIS criteria. Results: The algorithm, which identified patients with PJI based on MSIS criteria, achieved an f1-score (harmonic mean of precision and recall) of 0.911. Algorithm performance in extracting the presence of sinus tract, purulence, pathologic documentation of inflammation, and growth of cultured organisms from the involved TJA achieved f1-scores that ranged from 0.771 to 0.982, sensitivity that ranged from 0.730 to 1.000, and specificity that ranged from 0.947 to 1.000. Conclusion: NLP-enabled algorithms have the potential to automate data collection for PJI diagnostic elements, which could directly improve patient care and augment cohort surveillance and research efforts. Further validation is needed in other hospital settings. Level of Evidence: Level III, Diagnostic.

Original language	English (US)
Pages (from-to)	688-692
Number of pages	5
Journal	Journal of Arthroplasty
Volume	36
Issue number	2
DOIs	https://doi.org/10.1016/j.arth.2020.07.076
State	Published - Feb 2021

Keywords

artificial intelligence
electronic health records
natural language processing
periprosthetic joint infection
total joint arthroplasty

ASJC Scopus subject areas

Orthopedics and Sports Medicine

Access to Document

10.1016/j.arth.2020.07.076

Cite this

@article{783a88b904a44ecd985ecb56eacc20f7,

title = "Automated Detection of Periprosthetic Joint Infections and Data Elements Using Natural Language Processing",

abstract = "Background: Periprosthetic joint infection (PJI) data elements are contained in both structured and unstructured documents in electronic health records and require manual data collection. The goal of this study is to develop a natural language processing (NLP) algorithm to replicate manual chart review for PJI data elements. Methods: PJI was identified among all total joint arthroplasty (TJA) procedures performed at a single academic institution between 2000 and 2017. Data elements that comprise the Musculoskeletal Infection Society (MSIS) criteria were manually extracted and used as the gold standard for validation. A training sample of 1208 TJA surgeries (170 PJI cases) was randomly selected to develop the prototype NLP algorithms and an additional 1179 surgeries (150 PJI cases) were randomly selected as the test sample. The algorithms were applied to all consultation notes, operative notes, pathology reports, and microbiology reports to predict the correct status of PJI based on MSIS criteria. Results: The algorithm, which identified patients with PJI based on MSIS criteria, achieved an f1-score (harmonic mean of precision and recall) of 0.911. Algorithm performance in extracting the presence of sinus tract, purulence, pathologic documentation of inflammation, and growth of cultured organisms from the involved TJA achieved f1-scores that ranged from 0.771 to 0.982, sensitivity that ranged from 0.730 to 1.000, and specificity that ranged from 0.947 to 1.000. Conclusion: NLP-enabled algorithms have the potential to automate data collection for PJI diagnostic elements, which could directly improve patient care and augment cohort surveillance and research efforts. Further validation is needed in other hospital settings. Level of Evidence: Level III, Diagnostic.",

keywords = "artificial intelligence, electronic health records, natural language processing, periprosthetic joint infection, total joint arthroplasty",

author = "Sunyang Fu and Wyles, {Cody C.} and Osmon, {Douglas R.} and Carvour, {Martha L.} and Elham Sagheb and Taghi Ramazanian and Kremers, {Walter K.} and Lewallen, {David G.} and Berry, {Daniel J.} and Sunghwan Sohn and Kremers, {Hilal Maradit}",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier Inc.",

year = "2021",

month = feb,

doi = "10.1016/j.arth.2020.07.076",

language = "English (US)",

volume = "36",

pages = "688--692",

journal = "Journal of Arthroplasty",

issn = "0883-5403",

publisher = "Churchill Livingstone",

number = "2",

}

TY - JOUR

T1 - Automated Detection of Periprosthetic Joint Infections and Data Elements Using Natural Language Processing

AU - Fu, Sunyang

AU - Wyles, Cody C.

AU - Osmon, Douglas R.

AU - Carvour, Martha L.

AU - Sagheb, Elham

AU - Ramazanian, Taghi

AU - Kremers, Walter K.

AU - Lewallen, David G.

AU - Berry, Daniel J.

AU - Sohn, Sunghwan

AU - Kremers, Hilal Maradit

PY - 2021/2

Y1 - 2021/2

N2 - Background: Periprosthetic joint infection (PJI) data elements are contained in both structured and unstructured documents in electronic health records and require manual data collection. The goal of this study is to develop a natural language processing (NLP) algorithm to replicate manual chart review for PJI data elements. Methods: PJI was identified among all total joint arthroplasty (TJA) procedures performed at a single academic institution between 2000 and 2017. Data elements that comprise the Musculoskeletal Infection Society (MSIS) criteria were manually extracted and used as the gold standard for validation. A training sample of 1208 TJA surgeries (170 PJI cases) was randomly selected to develop the prototype NLP algorithms and an additional 1179 surgeries (150 PJI cases) were randomly selected as the test sample. The algorithms were applied to all consultation notes, operative notes, pathology reports, and microbiology reports to predict the correct status of PJI based on MSIS criteria. Results: The algorithm, which identified patients with PJI based on MSIS criteria, achieved an f1-score (harmonic mean of precision and recall) of 0.911. Algorithm performance in extracting the presence of sinus tract, purulence, pathologic documentation of inflammation, and growth of cultured organisms from the involved TJA achieved f1-scores that ranged from 0.771 to 0.982, sensitivity that ranged from 0.730 to 1.000, and specificity that ranged from 0.947 to 1.000. Conclusion: NLP-enabled algorithms have the potential to automate data collection for PJI diagnostic elements, which could directly improve patient care and augment cohort surveillance and research efforts. Further validation is needed in other hospital settings. Level of Evidence: Level III, Diagnostic.

AB - Background: Periprosthetic joint infection (PJI) data elements are contained in both structured and unstructured documents in electronic health records and require manual data collection. The goal of this study is to develop a natural language processing (NLP) algorithm to replicate manual chart review for PJI data elements. Methods: PJI was identified among all total joint arthroplasty (TJA) procedures performed at a single academic institution between 2000 and 2017. Data elements that comprise the Musculoskeletal Infection Society (MSIS) criteria were manually extracted and used as the gold standard for validation. A training sample of 1208 TJA surgeries (170 PJI cases) was randomly selected to develop the prototype NLP algorithms and an additional 1179 surgeries (150 PJI cases) were randomly selected as the test sample. The algorithms were applied to all consultation notes, operative notes, pathology reports, and microbiology reports to predict the correct status of PJI based on MSIS criteria. Results: The algorithm, which identified patients with PJI based on MSIS criteria, achieved an f1-score (harmonic mean of precision and recall) of 0.911. Algorithm performance in extracting the presence of sinus tract, purulence, pathologic documentation of inflammation, and growth of cultured organisms from the involved TJA achieved f1-scores that ranged from 0.771 to 0.982, sensitivity that ranged from 0.730 to 1.000, and specificity that ranged from 0.947 to 1.000. Conclusion: NLP-enabled algorithms have the potential to automate data collection for PJI diagnostic elements, which could directly improve patient care and augment cohort surveillance and research efforts. Further validation is needed in other hospital settings. Level of Evidence: Level III, Diagnostic.

KW - artificial intelligence

KW - electronic health records

KW - natural language processing

KW - periprosthetic joint infection

KW - total joint arthroplasty

UR - http://www.scopus.com/inward/record.url?scp=85089827823&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85089827823&partnerID=8YFLogxK

U2 - 10.1016/j.arth.2020.07.076

DO - 10.1016/j.arth.2020.07.076

M3 - Article

C2 - 32854996

AN - SCOPUS:85089827823

SN - 0883-5403

VL - 36

SP - 688

EP - 692

JO - Journal of Arthroplasty

JF - Journal of Arthroplasty

IS - 2

ER -

Automated Detection of Periprosthetic Joint Infections and Data Elements Using Natural Language Processing

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this