Machine Learning Models to Forecast Outcomes of Pituitary Surgery: A Systematic Review in Quality of Reporting and Current Evidence

Matheus M. Rech; Leonardo de Macedo Filho; Alexandra J. White; Carlos Perez-Vega; Susan L. Samson; Kaisorn L. Chaichana; Osarenoma U. Olomu; Alfredo Quinones-Hinojosa; Joao Paulo Almeida

doi:10.3390/brainsci13030495

Machine Learning Models to Forecast Outcomes of Pituitary Surgery: A Systematic Review in Quality of Reporting and Current Evidence

Matheus M. Rech, Leonardo de Macedo Filho, Alexandra J. White, Carlos Perez-Vega, Susan L. Samson, Kaisorn L. Chaichana, Osarenoma U. Olomu, Alfredo Quinones-Hinojosa, Joao Paulo Almeida

Neurosurgery

Research output: Contribution to journal › Review article › peer-review

Abstract

Background: The complex nature and heterogeneity involving pituitary surgery results have increased interest in machine learning (ML) applications for prediction of outcomes over the last decade. This study aims to systematically review the characteristics of ML models involving pituitary surgery outcome prediction and assess their reporting quality. Methods: We searched the PubMed, Scopus, and Web of Knowledge databases for publications on the use of ML to predict pituitary surgery outcomes. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) to assess report quality. Our search strategy was based on the terms “artificial intelligence”, “machine learning”, and “pituitary”. Results: 20 studies were included in this review. The principal models reported in each article were post-surgical endocrine outcomes (n = 10), tumor management (n = 3), and intra- and postoperative complications (n = 7). Overall, the included studies adhered to a median of 65% (IQR = 60–72%) of TRIPOD criteria, ranging from 43% to 83%. The median reported AUC was 0.84 (IQR = 0.80–0.91). The most popular algorithms were support vector machine (n = 5) and random forest (n = 5). Only two studies reported external validation and adherence to any reporting guideline. Calibration methods were not reported in 15 studies. No model achieved the phase of actual clinical applicability. Conclusion: Applications of ML in the prediction of pituitary outcomes are still nascent, as evidenced by the lack of any model validated for clinical practice. Although studies have demonstrated promising results, greater transparency in model development and reporting is needed to enable their use in clinical practice. Further adherence to reporting guidelines can help increase AI’s real-world utility and improve clinical practice.

Original language	English (US)
Article number	495
Journal	Brain Sciences
Volume	13
Issue number	3
DOIs	https://doi.org/10.3390/brainsci13030495
State	Published - Mar 2023

Keywords

Cushing disease
acromegaly
adenoma
artificial intelligence
machine learning
outcomes
pituitary adenoma
reporting quality assessment
systematic review

ASJC Scopus subject areas

General Neuroscience

Access to Document

10.3390/brainsci13030495

Cite this

@article{67dc90c6a6ca4920b96f12ef50a57b5c,

title = "Machine Learning Models to Forecast Outcomes of Pituitary Surgery: A Systematic Review in Quality of Reporting and Current Evidence",

abstract = "Background: The complex nature and heterogeneity involving pituitary surgery results have increased interest in machine learning (ML) applications for prediction of outcomes over the last decade. This study aims to systematically review the characteristics of ML models involving pituitary surgery outcome prediction and assess their reporting quality. Methods: We searched the PubMed, Scopus, and Web of Knowledge databases for publications on the use of ML to predict pituitary surgery outcomes. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) to assess report quality. Our search strategy was based on the terms “artificial intelligence”, “machine learning”, and “pituitary”. Results: 20 studies were included in this review. The principal models reported in each article were post-surgical endocrine outcomes (n = 10), tumor management (n = 3), and intra- and postoperative complications (n = 7). Overall, the included studies adhered to a median of 65% (IQR = 60–72%) of TRIPOD criteria, ranging from 43% to 83%. The median reported AUC was 0.84 (IQR = 0.80–0.91). The most popular algorithms were support vector machine (n = 5) and random forest (n = 5). Only two studies reported external validation and adherence to any reporting guideline. Calibration methods were not reported in 15 studies. No model achieved the phase of actual clinical applicability. Conclusion: Applications of ML in the prediction of pituitary outcomes are still nascent, as evidenced by the lack of any model validated for clinical practice. Although studies have demonstrated promising results, greater transparency in model development and reporting is needed to enable their use in clinical practice. Further adherence to reporting guidelines can help increase AI{\textquoteright}s real-world utility and improve clinical practice.",

keywords = "Cushing disease, acromegaly, adenoma, artificial intelligence, machine learning, outcomes, pituitary adenoma, reporting quality assessment, systematic review",

author = "Rech, {Matheus M.} and {de Macedo Filho}, Leonardo and White, {Alexandra J.} and Carlos Perez-Vega and Samson, {Susan L.} and Chaichana, {Kaisorn L.} and Olomu, {Osarenoma U.} and Alfredo Quinones-Hinojosa and Almeida, {Joao Paulo}",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = mar,

doi = "10.3390/brainsci13030495",

language = "English (US)",

volume = "13",

journal = "Brain Sciences",

issn = "2076-3425",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "3",

}

TY - JOUR

T1 - Machine Learning Models to Forecast Outcomes of Pituitary Surgery

T2 - A Systematic Review in Quality of Reporting and Current Evidence

AU - Rech, Matheus M.

AU - de Macedo Filho, Leonardo

AU - White, Alexandra J.

AU - Perez-Vega, Carlos

AU - Samson, Susan L.

AU - Chaichana, Kaisorn L.

AU - Olomu, Osarenoma U.

AU - Quinones-Hinojosa, Alfredo

AU - Almeida, Joao Paulo

PY - 2023/3

Y1 - 2023/3

N2 - Background: The complex nature and heterogeneity involving pituitary surgery results have increased interest in machine learning (ML) applications for prediction of outcomes over the last decade. This study aims to systematically review the characteristics of ML models involving pituitary surgery outcome prediction and assess their reporting quality. Methods: We searched the PubMed, Scopus, and Web of Knowledge databases for publications on the use of ML to predict pituitary surgery outcomes. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) to assess report quality. Our search strategy was based on the terms “artificial intelligence”, “machine learning”, and “pituitary”. Results: 20 studies were included in this review. The principal models reported in each article were post-surgical endocrine outcomes (n = 10), tumor management (n = 3), and intra- and postoperative complications (n = 7). Overall, the included studies adhered to a median of 65% (IQR = 60–72%) of TRIPOD criteria, ranging from 43% to 83%. The median reported AUC was 0.84 (IQR = 0.80–0.91). The most popular algorithms were support vector machine (n = 5) and random forest (n = 5). Only two studies reported external validation and adherence to any reporting guideline. Calibration methods were not reported in 15 studies. No model achieved the phase of actual clinical applicability. Conclusion: Applications of ML in the prediction of pituitary outcomes are still nascent, as evidenced by the lack of any model validated for clinical practice. Although studies have demonstrated promising results, greater transparency in model development and reporting is needed to enable their use in clinical practice. Further adherence to reporting guidelines can help increase AI’s real-world utility and improve clinical practice.

AB - Background: The complex nature and heterogeneity involving pituitary surgery results have increased interest in machine learning (ML) applications for prediction of outcomes over the last decade. This study aims to systematically review the characteristics of ML models involving pituitary surgery outcome prediction and assess their reporting quality. Methods: We searched the PubMed, Scopus, and Web of Knowledge databases for publications on the use of ML to predict pituitary surgery outcomes. We used the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) to assess report quality. Our search strategy was based on the terms “artificial intelligence”, “machine learning”, and “pituitary”. Results: 20 studies were included in this review. The principal models reported in each article were post-surgical endocrine outcomes (n = 10), tumor management (n = 3), and intra- and postoperative complications (n = 7). Overall, the included studies adhered to a median of 65% (IQR = 60–72%) of TRIPOD criteria, ranging from 43% to 83%. The median reported AUC was 0.84 (IQR = 0.80–0.91). The most popular algorithms were support vector machine (n = 5) and random forest (n = 5). Only two studies reported external validation and adherence to any reporting guideline. Calibration methods were not reported in 15 studies. No model achieved the phase of actual clinical applicability. Conclusion: Applications of ML in the prediction of pituitary outcomes are still nascent, as evidenced by the lack of any model validated for clinical practice. Although studies have demonstrated promising results, greater transparency in model development and reporting is needed to enable their use in clinical practice. Further adherence to reporting guidelines can help increase AI’s real-world utility and improve clinical practice.

KW - Cushing disease

KW - acromegaly

KW - adenoma

KW - artificial intelligence

KW - machine learning

KW - outcomes

KW - pituitary adenoma

KW - reporting quality assessment

KW - systematic review

UR - http://www.scopus.com/inward/record.url?scp=85151241650&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85151241650&partnerID=8YFLogxK

U2 - 10.3390/brainsci13030495

DO - 10.3390/brainsci13030495

M3 - Review article

AN - SCOPUS:85151241650

SN - 2076-3425

VL - 13

JO - Brain Sciences

JF - Brain Sciences

IS - 3

M1 - 495

ER -

Machine Learning Models to Forecast Outcomes of Pituitary Surgery: A Systematic Review in Quality of Reporting and Current Evidence

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this