Artificial intelligence to detect papilledema from ocular fundus photographs

Dan Milea; Raymond P. Najjar; Jiang Zhubo; Daniel Ting; Caroline Vasseneix; Xinxing Xu; Masoud Aghsaei Fard; Pedro Fonseca; Kavin Vanikieti; Wolf A. Lagrèze; Chiara La Morgia; Carol Y. Cheung; Steffen Hamann; Christophe Chiquet; Nicolae Sanda; Hui Yang; Luis J. Mejico; Marie Bénédicte Rougier; Richard Kho; Tran Thi Ha Chau; Shweta Singhal; Philippe Gohier; Catherine Clermont-Vignal; Ching Yu Cheng; Jost B. Jonas; Patrick Yu-Wai-Man; Clare L. Fraser; John J. Chen; Selvakumar Ambika; Neil R. Miller; Yong Liu; Nancy J. Newman; Tien Y. Wong; Valérie Biousse

doi:10.1056/NEJMoa1917130

Artificial intelligence to detect papilledema from ocular fundus photographs

Dan Milea, Raymond P. Najjar, Jiang Zhubo, Daniel Ting, Caroline Vasseneix, Xinxing Xu, Masoud Aghsaei Fard, Pedro Fonseca, Kavin Vanikieti, Wolf A. Lagrèze, Chiara La Morgia, Carol Y. Cheung, Steffen Hamann, Christophe Chiquet, Nicolae Sanda, Hui Yang, Luis J. Mejico, Marie Bénédicte Rougier, Richard Kho, Tran Thi Ha ChauShweta Singhal, Philippe Gohier, Catherine Clermont-Vignal, Ching Yu Cheng, Jost B. Jonas, Patrick Yu-Wai-Man, Clare L. Fraser, John J. Chen, Selvakumar Ambika, Neil R. Miller, Yong Liu, Nancy J. Newman, Tien Y. Wong, Valérie Biousse

Ophthalmology

Research output: Contribution to journal › Article › peer-review

39 Scopus citations

Abstract

BACKGROUND Nonophthalmologist physicians do not confidently perform direct ophthalmoscopy. The use of artificial intelligence to detect papilledema and other optic-disk abnormalities from fundus photographs has not been well studied. METHODS We trained, validated, and externally tested a deep-learning system to classify optic disks as being normal or having papilledema or other abnormalities from 15,846 retrospectively collected ocular fundus photographs that had been obtained with pharmacologic pupillary dilation and various digital cameras in persons from multiple ethnic populations. Of these photographs, 14,341 from 19 sites in 11 countries were used for training and validation, and 1505 photographs from 5 other sites were used for external testing. Performance at classifying the optic-disk appearance was evaluated by calculating the area under the receiver-operating-characteristic curve (AUC), sensitivity, and specificity, as compared with a reference standard of clinical diagnoses by neuro-ophthalmologists. RESULTS The training and validation data sets from 6779 patients included 14,341 photographs: 9156 of normal disks, 2148 of disks with papilledema, and 3037 of disks with other abnormalities. The percentage classified as being normal ranged across sites from 9.8 to 100%; the percentage classified as having papilledema ranged across sites from zero to 59.5%. In the validation set, the system discriminated disks with papilledema from normal disks and disks with nonpapilledema abnormalities with an AUC of 0.99 (95% confidence interval [CI], 0.98 to 0.99) and normal from abnormal disks with an AUC of 0.99 (95% CI, 0.99 to 0.99). In the external-testing data set of 1505 photographs, the system had an AUC for the detection of papilledema of 0.96 (95% CI, 0.95 to 0.97), a sensitivity of 96.4% (95% CI, 93.9 to 98.3), and a specificity of 84.7% (95% CI, 82.3 to 87.1). CONCLUSIONS A deep-learning system using fundus photographs with pharmacologically dilated pupils differentiated among optic disks with papilledema, normal disks, and disks with nonpapilledema abnormalities.

Original language	English (US)
Pages (from-to)	1687-1695
Number of pages	9
Journal	New England Journal of Medicine
Volume	382
Issue number	18
DOIs	https://doi.org/10.1056/NEJMoa1917130
State	Published - Apr 30 2020

ASJC Scopus subject areas

General Medicine

Access to Document

10.1056/NEJMoa1917130

Cite this

Milea, D., Najjar, R. P., Zhubo, J., Ting, D., Vasseneix, C., Xu, X., Fard, M. A., Fonseca, P., Vanikieti, K., Lagrèze, W. A., La Morgia, C., Cheung, C. Y., Hamann, S., Chiquet, C., Sanda, N., Yang, H., Mejico, L. J., Rougier, M. B., Kho, R., ... Biousse, V. (2020). Artificial intelligence to detect papilledema from ocular fundus photographs. New England Journal of Medicine, 382(18), 1687-1695. https://doi.org/10.1056/NEJMoa1917130

Milea, D, Najjar, RP, Zhubo, J, Ting, D, Vasseneix, C, Xu, X, Fard, MA, Fonseca, P, Vanikieti, K, Lagrèze, WA, La Morgia, C, Cheung, CY, Hamann, S, Chiquet, C, Sanda, N, Yang, H, Mejico, LJ, Rougier, MB, Kho, R, Chau, TTH, Singhal, S, Gohier, P, Clermont-Vignal, C, Cheng, CY, Jonas, JB, Yu-Wai-Man, P, Fraser, CL, Chen, JJ, Ambika, S, Miller, NR, Liu, Y, Newman, NJ, Wong, TY & Biousse, V 2020, 'Artificial intelligence to detect papilledema from ocular fundus photographs', New England Journal of Medicine, vol. 382, no. 18, pp. 1687-1695. https://doi.org/10.1056/NEJMoa1917130

@article{ff7f1d2282704d2eb0810eb621198c9d,

title = "Artificial intelligence to detect papilledema from ocular fundus photographs",

abstract = "BACKGROUND Nonophthalmologist physicians do not confidently perform direct ophthalmoscopy. The use of artificial intelligence to detect papilledema and other optic-disk abnormalities from fundus photographs has not been well studied. METHODS We trained, validated, and externally tested a deep-learning system to classify optic disks as being normal or having papilledema or other abnormalities from 15,846 retrospectively collected ocular fundus photographs that had been obtained with pharmacologic pupillary dilation and various digital cameras in persons from multiple ethnic populations. Of these photographs, 14,341 from 19 sites in 11 countries were used for training and validation, and 1505 photographs from 5 other sites were used for external testing. Performance at classifying the optic-disk appearance was evaluated by calculating the area under the receiver-operating-characteristic curve (AUC), sensitivity, and specificity, as compared with a reference standard of clinical diagnoses by neuro-ophthalmologists. RESULTS The training and validation data sets from 6779 patients included 14,341 photographs: 9156 of normal disks, 2148 of disks with papilledema, and 3037 of disks with other abnormalities. The percentage classified as being normal ranged across sites from 9.8 to 100%; the percentage classified as having papilledema ranged across sites from zero to 59.5%. In the validation set, the system discriminated disks with papilledema from normal disks and disks with nonpapilledema abnormalities with an AUC of 0.99 (95% confidence interval [CI], 0.98 to 0.99) and normal from abnormal disks with an AUC of 0.99 (95% CI, 0.99 to 0.99). In the external-testing data set of 1505 photographs, the system had an AUC for the detection of papilledema of 0.96 (95% CI, 0.95 to 0.97), a sensitivity of 96.4% (95% CI, 93.9 to 98.3), and a specificity of 84.7% (95% CI, 82.3 to 87.1). CONCLUSIONS A deep-learning system using fundus photographs with pharmacologically dilated pupils differentiated among optic disks with papilledema, normal disks, and disks with nonpapilledema abnormalities.",

author = "Dan Milea and Najjar, {Raymond P.} and Jiang Zhubo and Daniel Ting and Caroline Vasseneix and Xinxing Xu and Fard, {Masoud Aghsaei} and Pedro Fonseca and Kavin Vanikieti and Lagr{\`e}ze, {Wolf A.} and {La Morgia}, Chiara and Cheung, {Carol Y.} and Steffen Hamann and Christophe Chiquet and Nicolae Sanda and Hui Yang and Mejico, {Luis J.} and Rougier, {Marie B{\'e}n{\'e}dicte} and Richard Kho and Chau, {Tran Thi Ha} and Shweta Singhal and Philippe Gohier and Catherine Clermont-Vignal and Cheng, {Ching Yu} and Jonas, {Jost B.} and Patrick Yu-Wai-Man and Fraser, {Clare L.} and Chen, {John J.} and Selvakumar Ambika and Miller, {Neil R.} and Yong Liu and Newman, {Nancy J.} and Wong, {Tien Y.} and Val{\'e}rie Biousse",

note = "Publisher Copyright: Copyright {\textcopyright} 2020 Massachusetts Medical Society.",

year = "2020",

month = apr,

day = "30",

doi = "10.1056/NEJMoa1917130",

language = "English (US)",

volume = "382",

pages = "1687--1695",

journal = "New England Journal of Medicine",

issn = "0028-4793",

publisher = "Massachussetts Medical Society",

number = "18",

}

TY - JOUR

T1 - Artificial intelligence to detect papilledema from ocular fundus photographs

AU - Milea, Dan

AU - Najjar, Raymond P.

AU - Zhubo, Jiang

AU - Ting, Daniel

AU - Vasseneix, Caroline

AU - Xu, Xinxing

AU - Fard, Masoud Aghsaei

AU - Fonseca, Pedro

AU - Vanikieti, Kavin

AU - Lagrèze, Wolf A.

AU - La Morgia, Chiara

AU - Cheung, Carol Y.

AU - Hamann, Steffen

AU - Chiquet, Christophe

AU - Sanda, Nicolae

AU - Yang, Hui

AU - Mejico, Luis J.

AU - Rougier, Marie Bénédicte

AU - Kho, Richard

AU - Chau, Tran Thi Ha

AU - Singhal, Shweta

AU - Gohier, Philippe

AU - Clermont-Vignal, Catherine

AU - Cheng, Ching Yu

AU - Jonas, Jost B.

AU - Yu-Wai-Man, Patrick

AU - Fraser, Clare L.

AU - Chen, John J.

AU - Ambika, Selvakumar

AU - Miller, Neil R.

AU - Liu, Yong

AU - Newman, Nancy J.

AU - Wong, Tien Y.

AU - Biousse, Valérie

PY - 2020/4/30

Y1 - 2020/4/30

N2 - BACKGROUND Nonophthalmologist physicians do not confidently perform direct ophthalmoscopy. The use of artificial intelligence to detect papilledema and other optic-disk abnormalities from fundus photographs has not been well studied. METHODS We trained, validated, and externally tested a deep-learning system to classify optic disks as being normal or having papilledema or other abnormalities from 15,846 retrospectively collected ocular fundus photographs that had been obtained with pharmacologic pupillary dilation and various digital cameras in persons from multiple ethnic populations. Of these photographs, 14,341 from 19 sites in 11 countries were used for training and validation, and 1505 photographs from 5 other sites were used for external testing. Performance at classifying the optic-disk appearance was evaluated by calculating the area under the receiver-operating-characteristic curve (AUC), sensitivity, and specificity, as compared with a reference standard of clinical diagnoses by neuro-ophthalmologists. RESULTS The training and validation data sets from 6779 patients included 14,341 photographs: 9156 of normal disks, 2148 of disks with papilledema, and 3037 of disks with other abnormalities. The percentage classified as being normal ranged across sites from 9.8 to 100%; the percentage classified as having papilledema ranged across sites from zero to 59.5%. In the validation set, the system discriminated disks with papilledema from normal disks and disks with nonpapilledema abnormalities with an AUC of 0.99 (95% confidence interval [CI], 0.98 to 0.99) and normal from abnormal disks with an AUC of 0.99 (95% CI, 0.99 to 0.99). In the external-testing data set of 1505 photographs, the system had an AUC for the detection of papilledema of 0.96 (95% CI, 0.95 to 0.97), a sensitivity of 96.4% (95% CI, 93.9 to 98.3), and a specificity of 84.7% (95% CI, 82.3 to 87.1). CONCLUSIONS A deep-learning system using fundus photographs with pharmacologically dilated pupils differentiated among optic disks with papilledema, normal disks, and disks with nonpapilledema abnormalities.

AB - BACKGROUND Nonophthalmologist physicians do not confidently perform direct ophthalmoscopy. The use of artificial intelligence to detect papilledema and other optic-disk abnormalities from fundus photographs has not been well studied. METHODS We trained, validated, and externally tested a deep-learning system to classify optic disks as being normal or having papilledema or other abnormalities from 15,846 retrospectively collected ocular fundus photographs that had been obtained with pharmacologic pupillary dilation and various digital cameras in persons from multiple ethnic populations. Of these photographs, 14,341 from 19 sites in 11 countries were used for training and validation, and 1505 photographs from 5 other sites were used for external testing. Performance at classifying the optic-disk appearance was evaluated by calculating the area under the receiver-operating-characteristic curve (AUC), sensitivity, and specificity, as compared with a reference standard of clinical diagnoses by neuro-ophthalmologists. RESULTS The training and validation data sets from 6779 patients included 14,341 photographs: 9156 of normal disks, 2148 of disks with papilledema, and 3037 of disks with other abnormalities. The percentage classified as being normal ranged across sites from 9.8 to 100%; the percentage classified as having papilledema ranged across sites from zero to 59.5%. In the validation set, the system discriminated disks with papilledema from normal disks and disks with nonpapilledema abnormalities with an AUC of 0.99 (95% confidence interval [CI], 0.98 to 0.99) and normal from abnormal disks with an AUC of 0.99 (95% CI, 0.99 to 0.99). In the external-testing data set of 1505 photographs, the system had an AUC for the detection of papilledema of 0.96 (95% CI, 0.95 to 0.97), a sensitivity of 96.4% (95% CI, 93.9 to 98.3), and a specificity of 84.7% (95% CI, 82.3 to 87.1). CONCLUSIONS A deep-learning system using fundus photographs with pharmacologically dilated pupils differentiated among optic disks with papilledema, normal disks, and disks with nonpapilledema abnormalities.

UR - http://www.scopus.com/inward/record.url?scp=85084305461&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85084305461&partnerID=8YFLogxK

U2 - 10.1056/NEJMoa1917130

DO - 10.1056/NEJMoa1917130

M3 - Article

C2 - 32286748

AN - SCOPUS:85084305461

SN - 0028-4793

VL - 382

SP - 1687

EP - 1695

JO - New England Journal of Medicine

JF - New England Journal of Medicine

IS - 18

ER -

Artificial intelligence to detect papilledema from ocular fundus photographs

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this