Phenotyping severity of patient-centered outcomes using clinical notes: A prostate cancer use case

Selen Bozkurt; Rohan Paul; Jean Coquet; Ran Sun; Imon Banerjee; James D. Brooks; Tina Hernandez-Boussard

doi:10.1002/lrh2.10237

Phenotyping severity of patient-centered outcomes using clinical notes: A prostate cancer use case

Selen Bozkurt, Rohan Paul, Jean Coquet, Ran Sun, Imon Banerjee, James D. Brooks, Tina Hernandez-Boussard

Diagnostic Radiology

Research output: Contribution to journal › Article › peer-review

Abstract

Introduction: A learning health system (LHS) must improve care in ways that are meaningful to patients, integrating patient-centered outcomes (PCOs) into core infrastructure. PCOs are common following cancer treatment, such as urinary incontinence (UI) following prostatectomy. However, PCOs are not systematically recorded because they can only be described by the patient, are subjective and captured as unstructured text in the electronic health record (EHR). Therefore, PCOs pose significant challenges for phenotyping patients. Here, we present a natural language processing (NLP) approach for phenotyping patients with UI to classify their disease into severity subtypes, which can increase opportunities to provide precision-based therapy and promote a value-based delivery system. Methods: Patients undergoing prostate cancer treatment from 2008 to 2018 were identified at an academic medical center. Using a hybrid NLP pipeline that combines rule-based and deep learning methodologies, we classified positive UI cases as mild, moderate, and severe by mining clinical notes. Results: The rule-based model accurately classified UI into disease severity categories (accuracy: 0.86), which outperformed the deep learning model (accuracy: 0.73). In the deep learning model, the recall rates for mild and moderate group were higher than the precision rate (0.78 and 0.79, respectively). A hybrid model that combined both methods did not improve the accuracy of the rule-based model but did outperform the deep learning model (accuracy: 0.75). Conclusion: Phenotyping patients based on indication and severity of PCOs is essential to advance a patient centered LHS. EHRs contain valuable information on PCOs and by using NLP methods, it is feasible to accurately and efficiently phenotype PCO severity. Phenotyping must extend beyond the identification of disease to provide classification of disease severity that can be used to guide treatment and inform shared decision-making. Our methods demonstrate a path to a patient centered LHS that could advance precision medicine.

Original language	English (US)
Article number	e10237
Journal	Learning Health Systems
Volume	4
Issue number	4
DOIs	https://doi.org/10.1002/lrh2.10237
State	Published - Oct 1 2020

Keywords

deep phenotyping
natural language processing
prostate cancer
urinary incontinence

ASJC Scopus subject areas

Health Informatics
Public Health, Environmental and Occupational Health
Health Information Management

Access to Document

10.1002/lrh2.10237

Cite this

@article{e13db7ede17f4698ae963d697b1e149d,

title = "Phenotyping severity of patient-centered outcomes using clinical notes: A prostate cancer use case",

abstract = "Introduction: A learning health system (LHS) must improve care in ways that are meaningful to patients, integrating patient-centered outcomes (PCOs) into core infrastructure. PCOs are common following cancer treatment, such as urinary incontinence (UI) following prostatectomy. However, PCOs are not systematically recorded because they can only be described by the patient, are subjective and captured as unstructured text in the electronic health record (EHR). Therefore, PCOs pose significant challenges for phenotyping patients. Here, we present a natural language processing (NLP) approach for phenotyping patients with UI to classify their disease into severity subtypes, which can increase opportunities to provide precision-based therapy and promote a value-based delivery system. Methods: Patients undergoing prostate cancer treatment from 2008 to 2018 were identified at an academic medical center. Using a hybrid NLP pipeline that combines rule-based and deep learning methodologies, we classified positive UI cases as mild, moderate, and severe by mining clinical notes. Results: The rule-based model accurately classified UI into disease severity categories (accuracy: 0.86), which outperformed the deep learning model (accuracy: 0.73). In the deep learning model, the recall rates for mild and moderate group were higher than the precision rate (0.78 and 0.79, respectively). A hybrid model that combined both methods did not improve the accuracy of the rule-based model but did outperform the deep learning model (accuracy: 0.75). Conclusion: Phenotyping patients based on indication and severity of PCOs is essential to advance a patient centered LHS. EHRs contain valuable information on PCOs and by using NLP methods, it is feasible to accurately and efficiently phenotype PCO severity. Phenotyping must extend beyond the identification of disease to provide classification of disease severity that can be used to guide treatment and inform shared decision-making. Our methods demonstrate a path to a patient centered LHS that could advance precision medicine.",

keywords = "deep phenotyping, natural language processing, prostate cancer, urinary incontinence",

author = "Selen Bozkurt and Rohan Paul and Jean Coquet and Ran Sun and Imon Banerjee and Brooks, {James D.} and Tina Hernandez-Boussard",

note = "Publisher Copyright: {\textcopyright} 2020 The Authors. Learning Health Systems published by Wiley Periodicals LLC on behalf of the University of Michigan.",

year = "2020",

month = oct,

day = "1",

doi = "10.1002/lrh2.10237",

language = "English (US)",

volume = "4",

journal = "Learning Health Systems",

issn = "2379-6146",

publisher = "John Wiley and Sons Inc.",

number = "4",

}

TY - JOUR

T1 - Phenotyping severity of patient-centered outcomes using clinical notes

T2 - A prostate cancer use case

AU - Bozkurt, Selen

AU - Paul, Rohan

AU - Coquet, Jean

AU - Sun, Ran

AU - Banerjee, Imon

AU - Brooks, James D.

AU - Hernandez-Boussard, Tina

PY - 2020/10/1

Y1 - 2020/10/1

N2 - Introduction: A learning health system (LHS) must improve care in ways that are meaningful to patients, integrating patient-centered outcomes (PCOs) into core infrastructure. PCOs are common following cancer treatment, such as urinary incontinence (UI) following prostatectomy. However, PCOs are not systematically recorded because they can only be described by the patient, are subjective and captured as unstructured text in the electronic health record (EHR). Therefore, PCOs pose significant challenges for phenotyping patients. Here, we present a natural language processing (NLP) approach for phenotyping patients with UI to classify their disease into severity subtypes, which can increase opportunities to provide precision-based therapy and promote a value-based delivery system. Methods: Patients undergoing prostate cancer treatment from 2008 to 2018 were identified at an academic medical center. Using a hybrid NLP pipeline that combines rule-based and deep learning methodologies, we classified positive UI cases as mild, moderate, and severe by mining clinical notes. Results: The rule-based model accurately classified UI into disease severity categories (accuracy: 0.86), which outperformed the deep learning model (accuracy: 0.73). In the deep learning model, the recall rates for mild and moderate group were higher than the precision rate (0.78 and 0.79, respectively). A hybrid model that combined both methods did not improve the accuracy of the rule-based model but did outperform the deep learning model (accuracy: 0.75). Conclusion: Phenotyping patients based on indication and severity of PCOs is essential to advance a patient centered LHS. EHRs contain valuable information on PCOs and by using NLP methods, it is feasible to accurately and efficiently phenotype PCO severity. Phenotyping must extend beyond the identification of disease to provide classification of disease severity that can be used to guide treatment and inform shared decision-making. Our methods demonstrate a path to a patient centered LHS that could advance precision medicine.

AB - Introduction: A learning health system (LHS) must improve care in ways that are meaningful to patients, integrating patient-centered outcomes (PCOs) into core infrastructure. PCOs are common following cancer treatment, such as urinary incontinence (UI) following prostatectomy. However, PCOs are not systematically recorded because they can only be described by the patient, are subjective and captured as unstructured text in the electronic health record (EHR). Therefore, PCOs pose significant challenges for phenotyping patients. Here, we present a natural language processing (NLP) approach for phenotyping patients with UI to classify their disease into severity subtypes, which can increase opportunities to provide precision-based therapy and promote a value-based delivery system. Methods: Patients undergoing prostate cancer treatment from 2008 to 2018 were identified at an academic medical center. Using a hybrid NLP pipeline that combines rule-based and deep learning methodologies, we classified positive UI cases as mild, moderate, and severe by mining clinical notes. Results: The rule-based model accurately classified UI into disease severity categories (accuracy: 0.86), which outperformed the deep learning model (accuracy: 0.73). In the deep learning model, the recall rates for mild and moderate group were higher than the precision rate (0.78 and 0.79, respectively). A hybrid model that combined both methods did not improve the accuracy of the rule-based model but did outperform the deep learning model (accuracy: 0.75). Conclusion: Phenotyping patients based on indication and severity of PCOs is essential to advance a patient centered LHS. EHRs contain valuable information on PCOs and by using NLP methods, it is feasible to accurately and efficiently phenotype PCO severity. Phenotyping must extend beyond the identification of disease to provide classification of disease severity that can be used to guide treatment and inform shared decision-making. Our methods demonstrate a path to a patient centered LHS that could advance precision medicine.

KW - deep phenotyping

KW - natural language processing

KW - prostate cancer

KW - urinary incontinence

UR - http://www.scopus.com/inward/record.url?scp=85088031672&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85088031672&partnerID=8YFLogxK

U2 - 10.1002/lrh2.10237

DO - 10.1002/lrh2.10237

M3 - Article

AN - SCOPUS:85088031672

SN - 2379-6146

VL - 4

JO - Learning Health Systems

JF - Learning Health Systems

IS - 4

M1 - e10237

ER -

Phenotyping severity of patient-centered outcomes using clinical notes: A prostate cancer use case

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this