Identification of delirium from real-world electronic health record clinical notes

Jennifer St. Sauver; Sunyang Fu; Sunghwan Sohn; Susan Weston; Chun Fan; Janet Olson; Bjoerg Thorsteinsdottir; Nathan Lebrasseur; Sandeep Pagali; Walter Rocca; Hongfang Liu

doi:10.1017/cts.2023.610

Identification of delirium from real-world electronic health record clinical notes

Jennifer St. Sauver, Sunyang Fu, Sunghwan Sohn, Susan Weston, Chun Fan, Janet Olson, Bjoerg Thorsteinsdottir, Nathan Lebrasseur, Sandeep Pagali, Walter Rocca, Hongfang Liu

Research output: Contribution to journal › Article › peer-review

Abstract

Introduction: We tested the ability of our natural language processing (NLP) algorithm to identify delirium episodes in a large-scale study using real-world clinical notes. Methods: We used the Rochester Epidemiology Project to identify persons ≥ 65 years who were hospitalized between 2011 and 2017. We identified all persons with an International Classification of Diseases code for delirium within ±14 days of a hospitalization. We independently applied our NLP algorithm to all clinical notes for this same population. We calculated rates using number of delirium episodes as the numerator and number of hospitalizations as the denominator. Rates were estimated overall, by demographic characteristics, and by year of episode, and differences were tested using Poisson regression. Results: In total, 14,255 persons had 37,554 hospitalizations between 2011 and 2017. The code-based delirium rate was 3.02 per 100 hospitalizations (95% CI: 2.85, 3.20). The NLP-based rate was 7.36 per 100 (95% CI: 7.09, 7.64). Rates increased with age (both p < 0.0001). Code-based rates were higher in men compared to women (p = 0.03), but NLP-based rates were similar by sex (p = 0.89). Code-based rates were similar by race and ethnicity, but NLP-based rates were higher in the White population compared to the Black and Asian populations (p = 0.001). Both types of rates increased significantly over time (both p values < 0.001). Conclusions: The NLP algorithm identified more delirium episodes compared to the ICD code method. However, NLP may still underestimate delirium cases because of limitations in real-world clinical notes, including incomplete documentation, practice changes over time, and missing clinical notes in some time periods.

Original language	English (US)
Article number	e187
Journal	Journal of Clinical and Translational Science
Volume	7
Issue number	1
DOIs	https://doi.org/10.1017/cts.2023.610
State	Published - Aug 24 2023

Keywords

Delirium
International Classification of Diseases (ICD)
bioinformatics
electronic health records
natural language processing algorithm

ASJC Scopus subject areas

General Medicine

Access to Document

10.1017/cts.2023.610

Cite this

@article{4f46a61473f34076b5c902094cf94b68,

title = "Identification of delirium from real-world electronic health record clinical notes",

abstract = "Introduction: We tested the ability of our natural language processing (NLP) algorithm to identify delirium episodes in a large-scale study using real-world clinical notes. Methods: We used the Rochester Epidemiology Project to identify persons ≥ 65 years who were hospitalized between 2011 and 2017. We identified all persons with an International Classification of Diseases code for delirium within ±14 days of a hospitalization. We independently applied our NLP algorithm to all clinical notes for this same population. We calculated rates using number of delirium episodes as the numerator and number of hospitalizations as the denominator. Rates were estimated overall, by demographic characteristics, and by year of episode, and differences were tested using Poisson regression. Results: In total, 14,255 persons had 37,554 hospitalizations between 2011 and 2017. The code-based delirium rate was 3.02 per 100 hospitalizations (95% CI: 2.85, 3.20). The NLP-based rate was 7.36 per 100 (95% CI: 7.09, 7.64). Rates increased with age (both p < 0.0001). Code-based rates were higher in men compared to women (p = 0.03), but NLP-based rates were similar by sex (p = 0.89). Code-based rates were similar by race and ethnicity, but NLP-based rates were higher in the White population compared to the Black and Asian populations (p = 0.001). Both types of rates increased significantly over time (both p values < 0.001). Conclusions: The NLP algorithm identified more delirium episodes compared to the ICD code method. However, NLP may still underestimate delirium cases because of limitations in real-world clinical notes, including incomplete documentation, practice changes over time, and missing clinical notes in some time periods.",

keywords = "Delirium, International Classification of Diseases (ICD), bioinformatics, electronic health records, natural language processing algorithm",

author = "{St. Sauver}, Jennifer and Sunyang Fu and Sunghwan Sohn and Susan Weston and Chun Fan and Janet Olson and Bjoerg Thorsteinsdottir and Nathan Lebrasseur and Sandeep Pagali and Walter Rocca and Hongfang Liu",

note = "Publisher Copyright: {\textcopyright} Mayo Foundation for Medical Education and Research (Mayo Clinic), 2023. Published by Cambridge University Press on behalf of The Association for Clinical and Translational Science.",

year = "2023",

month = aug,

day = "24",

doi = "10.1017/cts.2023.610",

language = "English (US)",

volume = "7",

journal = "Journal of Clinical and Translational Science",

issn = "2059-8661",

publisher = "Cambridge University Press",

number = "1",

}

TY - JOUR

T1 - Identification of delirium from real-world electronic health record clinical notes

AU - St. Sauver, Jennifer

AU - Fu, Sunyang

AU - Sohn, Sunghwan

AU - Weston, Susan

AU - Fan, Chun

AU - Olson, Janet

AU - Thorsteinsdottir, Bjoerg

AU - Lebrasseur, Nathan

AU - Pagali, Sandeep

AU - Rocca, Walter

AU - Liu, Hongfang

N1 - Publisher Copyright: © Mayo Foundation for Medical Education and Research (Mayo Clinic), 2023. Published by Cambridge University Press on behalf of The Association for Clinical and Translational Science.

PY - 2023/8/24

Y1 - 2023/8/24

N2 - Introduction: We tested the ability of our natural language processing (NLP) algorithm to identify delirium episodes in a large-scale study using real-world clinical notes. Methods: We used the Rochester Epidemiology Project to identify persons ≥ 65 years who were hospitalized between 2011 and 2017. We identified all persons with an International Classification of Diseases code for delirium within ±14 days of a hospitalization. We independently applied our NLP algorithm to all clinical notes for this same population. We calculated rates using number of delirium episodes as the numerator and number of hospitalizations as the denominator. Rates were estimated overall, by demographic characteristics, and by year of episode, and differences were tested using Poisson regression. Results: In total, 14,255 persons had 37,554 hospitalizations between 2011 and 2017. The code-based delirium rate was 3.02 per 100 hospitalizations (95% CI: 2.85, 3.20). The NLP-based rate was 7.36 per 100 (95% CI: 7.09, 7.64). Rates increased with age (both p < 0.0001). Code-based rates were higher in men compared to women (p = 0.03), but NLP-based rates were similar by sex (p = 0.89). Code-based rates were similar by race and ethnicity, but NLP-based rates were higher in the White population compared to the Black and Asian populations (p = 0.001). Both types of rates increased significantly over time (both p values < 0.001). Conclusions: The NLP algorithm identified more delirium episodes compared to the ICD code method. However, NLP may still underestimate delirium cases because of limitations in real-world clinical notes, including incomplete documentation, practice changes over time, and missing clinical notes in some time periods.

AB - Introduction: We tested the ability of our natural language processing (NLP) algorithm to identify delirium episodes in a large-scale study using real-world clinical notes. Methods: We used the Rochester Epidemiology Project to identify persons ≥ 65 years who were hospitalized between 2011 and 2017. We identified all persons with an International Classification of Diseases code for delirium within ±14 days of a hospitalization. We independently applied our NLP algorithm to all clinical notes for this same population. We calculated rates using number of delirium episodes as the numerator and number of hospitalizations as the denominator. Rates were estimated overall, by demographic characteristics, and by year of episode, and differences were tested using Poisson regression. Results: In total, 14,255 persons had 37,554 hospitalizations between 2011 and 2017. The code-based delirium rate was 3.02 per 100 hospitalizations (95% CI: 2.85, 3.20). The NLP-based rate was 7.36 per 100 (95% CI: 7.09, 7.64). Rates increased with age (both p < 0.0001). Code-based rates were higher in men compared to women (p = 0.03), but NLP-based rates were similar by sex (p = 0.89). Code-based rates were similar by race and ethnicity, but NLP-based rates were higher in the White population compared to the Black and Asian populations (p = 0.001). Both types of rates increased significantly over time (both p values < 0.001). Conclusions: The NLP algorithm identified more delirium episodes compared to the ICD code method. However, NLP may still underestimate delirium cases because of limitations in real-world clinical notes, including incomplete documentation, practice changes over time, and missing clinical notes in some time periods.

KW - Delirium

KW - International Classification of Diseases (ICD)

KW - bioinformatics

KW - electronic health records

KW - natural language processing algorithm

UR - http://www.scopus.com/inward/record.url?scp=85169803353&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85169803353&partnerID=8YFLogxK

U2 - 10.1017/cts.2023.610

DO - 10.1017/cts.2023.610

M3 - Article

AN - SCOPUS:85169803353

SN - 2059-8661

VL - 7

JO - Journal of Clinical and Translational Science

JF - Journal of Clinical and Translational Science

IS - 1

M1 - e187

ER -

Identification of delirium from real-world electronic health record clinical notes

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this