Fair patient model: Mitigating bias in the patient representation learned from the electronic health records

Sonish Sivarajkumar; Yufei Huang; Yanshan Wang

doi:10.1016/j.jbi.2023.104544

Fair patient model: Mitigating bias in the patient representation learned from the electronic health records

Sonish Sivarajkumar, Yufei Huang, Yanshan Wang

Digital Health Sciences

Research output: Contribution to journal › Article › peer-review

Abstract

Objective: To pre-train fair and unbiased patient representations from Electronic Health Records (EHRs) using a novel weighted loss function that reduces bias and improves fairness in deep representation learning models. Methods: We defined a new loss function, called weighted loss function, in the deep representation learning model to balance the importance of different groups of patients and features. We applied the proposed model, called Fair Patient Model (FPM), to a sample of 34,739 patients from the MIMIC-III dataset and learned patient representations for four clinical outcome prediction tasks. Results: FPM outperformed the baseline models in terms of three fairness metrics: demographic parity, equality of opportunity difference, and equalized odds ratio. FPM also achieved comparable predictive performance with the baselines, with an average accuracy of 0.7912. Feature analysis revealed that FPM captured more information from clinical features than the baselines. Conclusion: FPM is a novel method to pre-train fair and unbiased patient representations from the EHR data using a weighted loss function. The learned representations can be used for various downstream tasks in healthcare and can be extended to other domains where fairness is important.

Original language	English (US)
Article number	104544
Journal	Journal of Biomedical Informatics
Volume	148
DOIs	https://doi.org/10.1016/j.jbi.2023.104544
State	Published - Dec 2023

ASJC Scopus subject areas

Health Informatics
Computer Science Applications

Access to Document

10.1016/j.jbi.2023.104544

Cite this

@article{a95ee4ffd43749bb910737189887fc08,

title = "Fair patient model: Mitigating bias in the patient representation learned from the electronic health records",

abstract = "Objective: To pre-train fair and unbiased patient representations from Electronic Health Records (EHRs) using a novel weighted loss function that reduces bias and improves fairness in deep representation learning models. Methods: We defined a new loss function, called weighted loss function, in the deep representation learning model to balance the importance of different groups of patients and features. We applied the proposed model, called Fair Patient Model (FPM), to a sample of 34,739 patients from the MIMIC-III dataset and learned patient representations for four clinical outcome prediction tasks. Results: FPM outperformed the baseline models in terms of three fairness metrics: demographic parity, equality of opportunity difference, and equalized odds ratio. FPM also achieved comparable predictive performance with the baselines, with an average accuracy of 0.7912. Feature analysis revealed that FPM captured more information from clinical features than the baselines. Conclusion: FPM is a novel method to pre-train fair and unbiased patient representations from the EHR data using a weighted loss function. The learned representations can be used for various downstream tasks in healthcare and can be extended to other domains where fairness is important.",

author = "Sonish Sivarajkumar and Yufei Huang and Yanshan Wang",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Inc.",

year = "2023",

month = dec,

doi = "10.1016/j.jbi.2023.104544",

language = "English (US)",

volume = "148",

journal = "Journal of Biomedical Informatics",

issn = "1532-0464",

publisher = "Academic Press Inc.",

}

TY - JOUR

T1 - Fair patient model

T2 - Mitigating bias in the patient representation learned from the electronic health records

AU - Sivarajkumar, Sonish

AU - Huang, Yufei

AU - Wang, Yanshan

PY - 2023/12

Y1 - 2023/12

N2 - Objective: To pre-train fair and unbiased patient representations from Electronic Health Records (EHRs) using a novel weighted loss function that reduces bias and improves fairness in deep representation learning models. Methods: We defined a new loss function, called weighted loss function, in the deep representation learning model to balance the importance of different groups of patients and features. We applied the proposed model, called Fair Patient Model (FPM), to a sample of 34,739 patients from the MIMIC-III dataset and learned patient representations for four clinical outcome prediction tasks. Results: FPM outperformed the baseline models in terms of three fairness metrics: demographic parity, equality of opportunity difference, and equalized odds ratio. FPM also achieved comparable predictive performance with the baselines, with an average accuracy of 0.7912. Feature analysis revealed that FPM captured more information from clinical features than the baselines. Conclusion: FPM is a novel method to pre-train fair and unbiased patient representations from the EHR data using a weighted loss function. The learned representations can be used for various downstream tasks in healthcare and can be extended to other domains where fairness is important.

AB - Objective: To pre-train fair and unbiased patient representations from Electronic Health Records (EHRs) using a novel weighted loss function that reduces bias and improves fairness in deep representation learning models. Methods: We defined a new loss function, called weighted loss function, in the deep representation learning model to balance the importance of different groups of patients and features. We applied the proposed model, called Fair Patient Model (FPM), to a sample of 34,739 patients from the MIMIC-III dataset and learned patient representations for four clinical outcome prediction tasks. Results: FPM outperformed the baseline models in terms of three fairness metrics: demographic parity, equality of opportunity difference, and equalized odds ratio. FPM also achieved comparable predictive performance with the baselines, with an average accuracy of 0.7912. Feature analysis revealed that FPM captured more information from clinical features than the baselines. Conclusion: FPM is a novel method to pre-train fair and unbiased patient representations from the EHR data using a weighted loss function. The learned representations can be used for various downstream tasks in healthcare and can be extended to other domains where fairness is important.

UR - http://www.scopus.com/inward/record.url?scp=85178389335&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85178389335&partnerID=8YFLogxK

U2 - 10.1016/j.jbi.2023.104544

DO - 10.1016/j.jbi.2023.104544

M3 - Article

C2 - 37995843

AN - SCOPUS:85178389335

SN - 1532-0464

VL - 148

JO - Journal of Biomedical Informatics

JF - Journal of Biomedical Informatics

M1 - 104544

ER -

Fair patient model: Mitigating bias in the patient representation learned from the electronic health records

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Cite this