TY - JOUR
T1 - Use of Natural Language Processing Algorithms to Identify Common Data Elements in Operative Notes for Knee Arthroplasty
AU - Sagheb, Elham
AU - Ramazanian, Taghi
AU - Tafti, Ahmad P.
AU - Fu, Sunyang
AU - Kremers, Walter K.
AU - Berry, Daniel J.
AU - Lewallen, David G.
AU - Sohn, Sunghwan
AU - Maradit Kremers, Hilal
N1 - Funding Information:
Funding: Supported by the National Institutes of Health (NIH) grants R01AR73147 and P30AR76312 .
Publisher Copyright:
© 2020 Elsevier Inc.
PY - 2021/3
Y1 - 2021/3
N2 - Background: Natural language processing (NLP) methods have the capability to process clinical free text in electronic health records, decreasing the need for costly manual chart review, and improving data quality. We developed rule-based NLP algorithms to automatically extract surgery specific data elements from knee arthroplasty operative notes. Methods: Within a cohort of 20,000 knee arthroplasty operative notes from 2000 to 2017 at a large tertiary institution, we randomly selected independent pairs of training and test sets to develop and evaluate NLP algorithms to detect five major data elements. The size of the training and test datasets were similar and ranged between 420 to 1592 surgeries. Expert rules using keywords in operative notes were used to implement NLP algorithms capturing: (1) category of surgery (total knee arthroplasty, unicompartmental knee arthroplasty, patellofemoral arthroplasty), (2) laterality of surgery, (3) constraint type, (4) presence of patellar resurfacing, and (5) implant model (catalog numbers). We used institutional registry data as our gold standard to evaluate the NLP algorithms. Results: NLP algorithms to detect the category of surgery, laterality, constraint, and patellar resurfacing achieved 98.3%, 99.5%, 99.2%, and 99.4% accuracy on test datasets, respectively. The implant model algorithm achieved an F1-score (harmonic mean of precision and recall) of 99.9%. Conclusions: NLP algorithms are a promising alternative to costly manual chart review to automate the extraction of embedded information within knee arthroplasty operative notes. Further validation in other hospital settings will enhance widespread implementation and efficiency in data capture for research and clinical purposes. Level of Evidence: Level III.
AB - Background: Natural language processing (NLP) methods have the capability to process clinical free text in electronic health records, decreasing the need for costly manual chart review, and improving data quality. We developed rule-based NLP algorithms to automatically extract surgery specific data elements from knee arthroplasty operative notes. Methods: Within a cohort of 20,000 knee arthroplasty operative notes from 2000 to 2017 at a large tertiary institution, we randomly selected independent pairs of training and test sets to develop and evaluate NLP algorithms to detect five major data elements. The size of the training and test datasets were similar and ranged between 420 to 1592 surgeries. Expert rules using keywords in operative notes were used to implement NLP algorithms capturing: (1) category of surgery (total knee arthroplasty, unicompartmental knee arthroplasty, patellofemoral arthroplasty), (2) laterality of surgery, (3) constraint type, (4) presence of patellar resurfacing, and (5) implant model (catalog numbers). We used institutional registry data as our gold standard to evaluate the NLP algorithms. Results: NLP algorithms to detect the category of surgery, laterality, constraint, and patellar resurfacing achieved 98.3%, 99.5%, 99.2%, and 99.4% accuracy on test datasets, respectively. The implant model algorithm achieved an F1-score (harmonic mean of precision and recall) of 99.9%. Conclusions: NLP algorithms are a promising alternative to costly manual chart review to automate the extraction of embedded information within knee arthroplasty operative notes. Further validation in other hospital settings will enhance widespread implementation and efficiency in data capture for research and clinical purposes. Level of Evidence: Level III.
KW - artificial intelligence
KW - constraint
KW - electronic health records
KW - natural language processing
KW - patella resurfacing
KW - total knee arthroplasty
UR - http://www.scopus.com/inward/record.url?scp=85092443985&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85092443985&partnerID=8YFLogxK
U2 - 10.1016/j.arth.2020.09.029
DO - 10.1016/j.arth.2020.09.029
M3 - Article
C2 - 33051119
AN - SCOPUS:85092443985
SN - 0883-5403
VL - 36
SP - 922
EP - 926
JO - Journal of Arthroplasty
JF - Journal of Arthroplasty
IS - 3
ER -