TY - JOUR
T1 - Getting More Out of Large Databases and EHRs with Natural Language Processing and Artificial Intelligence
T2 - The Future Is Here
AU - Khosravi, Bardia
AU - Rouzrokh, Pouria
AU - Erickson, Bradley J.
N1 - Funding Information:
Disclosure: This work was funded by a grant from the National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS) (P30AR76312) and the American Joint Replacement Research-Collaborative (AJRR-C) https://ajrrc.org/ . The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The Disclosure of Potential Conflicts of Interest forms are provided with the online version of the article ( http://links.lww.com/JBJS/H134 ).
Publisher Copyright:
© 2022 Lippincott Williams and Wilkins. All rights reserved.
PY - 2022/10/19
Y1 - 2022/10/19
N2 - Electronic health records (EHRs) have created great opportunities to collect various information from clinical patient encounters. However, most EHR data are stored in unstructured form (e.g., clinical notes, surgical notes, and medication instructions), and researchers need data to be in computable form (structured) to extract meaningful relationships involving variables that can influence patient outcomes. Clinical natural language processing (NLP) is the field of extracting structured data from unstructured text documents in EHRs. Clinical text has several characteristics that mandate the use of special techniques to extract structured information from them compared with generic NLP methods. In this article, we define clinical NLP models, introduce different methods of information extraction from unstructured data using NLP, and describe the basic technical aspects of how deep learning-based NLP models work. We conclude by noting the challenges of working with clinical NLP models and summarizing the general steps needed to launch an NLP project.
AB - Electronic health records (EHRs) have created great opportunities to collect various information from clinical patient encounters. However, most EHR data are stored in unstructured form (e.g., clinical notes, surgical notes, and medication instructions), and researchers need data to be in computable form (structured) to extract meaningful relationships involving variables that can influence patient outcomes. Clinical natural language processing (NLP) is the field of extracting structured data from unstructured text documents in EHRs. Clinical text has several characteristics that mandate the use of special techniques to extract structured information from them compared with generic NLP methods. In this article, we define clinical NLP models, introduce different methods of information extraction from unstructured data using NLP, and describe the basic technical aspects of how deep learning-based NLP models work. We conclude by noting the challenges of working with clinical NLP models and summarizing the general steps needed to launch an NLP project.
UR - http://www.scopus.com/inward/record.url?scp=85140415094&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140415094&partnerID=8YFLogxK
U2 - 10.2106/JBJS.22.00567
DO - 10.2106/JBJS.22.00567
M3 - Review article
C2 - 36260045
AN - SCOPUS:85140415094
SN - 0021-9355
VL - 104
SP - 51
EP - 55
JO - Journal of Bone and Joint Surgery
JF - Journal of Bone and Joint Surgery
ER -