TY - JOUR
T1 - Commonalities across computational workflows for uncovering explanatory variants in undiagnosed cases
AU - Undiagnosed Diseases Network
AU - Kobren, Shilpa Nadimpalli
AU - Baldridge, Dustin
AU - Velinder, Matt
AU - Krier, Joel B.
AU - LeBlanc, Kimberly
AU - Esteves, Cecilia
AU - Pusey, Barbara N.
AU - Züchner, Stephan
AU - Blue, Elizabeth
AU - Lee, Hane
AU - Huang, Alden
AU - Bastarache, Lisa
AU - Bican, Anna
AU - Cogan, Joy
AU - Marwaha, Shruti
AU - Alkelai, Anna
AU - Murdock, David R.
AU - Liu, Pengfei
AU - Wegner, Daniel J.
AU - Paul, Alexander J.
AU - Acosta, Maria T.
AU - Adam, Margaret
AU - Adams, David R.
AU - Agrawal, Pankaj B.
AU - Alejandro, Mercedes E.
AU - Alvey, Justin
AU - Amendola, Laura
AU - Andrews, Ashley
AU - Ashley, Euan A.
AU - Azamian, Mahshid S.
AU - Bacino, Carlos A.
AU - Bademci, Guney
AU - Baker, Eva
AU - Balasubramanyam, Ashok
AU - Bale, Jim
AU - Bamshad, Michael
AU - Barbouth, Deborah
AU - Bayrak-Toydemir, Pinar
AU - Beck, Anita
AU - Beggs, Alan H.
AU - Behrens, Edward
AU - Bejerano, Gill
AU - Bennett, Jimmy
AU - Berg-Rood, Beverly
AU - Bernstein, Jonathan A.
AU - Dasari, Surendra
AU - Lanpher, Brendan C.
AU - Lanza, Ian R.
AU - Morava, Eva
AU - Oglesbee, Devin
N1 - Funding Information:
Thank you to the UDN Tool Building Coalition for discussions about tools in use or under development, to Daniel Traviglia for clarifications on UDN data availability, and to Rebecca Reimers for writing feedback. Research reported here was supported by the NIH Common Fund, through the Office of Strategic Coordination/Office of the NIH Director under award numbers U01HG007530, U01HG007942, U01HG007672, U01HG007690, U01HG010218, U01HG007703, U01HG010230, U01HG010217, U01HG010233, U01HG007674, and U01HG010215, and by the Intramural Research Program of the National Human Genome Research Institute. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Publisher Copyright:
© 2021, The Author(s).
PY - 2021/6/1
Y1 - 2021/6/1
N2 - Purpose: Genomic sequencing has become an increasingly powerful and relevant tool to be leveraged for the discovery of genetic aberrations underlying rare, Mendelian conditions. Although the computational tools incorporated into diagnostic workflows for this task are continually evolving and improving, we nevertheless sought to investigate commonalities across sequencing processing workflows to reveal consensus and standard practice tools and highlight exploratory analyses where technical and theoretical method improvements would be most impactful. Methods: We collected details regarding the computational approaches used by a genetic testing laboratory and 11 clinical research sites in the United States participating in the Undiagnosed Diseases Network via meetings with bioinformaticians, online survey forms, and analyses of internal protocols. Results: We found that tools for processing genomic sequencing data can be grouped into four distinct categories. Whereas well-established practices exist for initial variant calling and quality control steps, there is substantial divergence across sites in later stages for variant prioritization and multimodal data integration, demonstrating a diversity of approaches for solving the most mysterious undiagnosed cases. Conclusion: The largest differences across diagnostic workflows suggest that advances in structural variant detection, noncoding variant interpretation, and integration of additional biomedical data may be especially promising for solving chronically undiagnosed cases.
AB - Purpose: Genomic sequencing has become an increasingly powerful and relevant tool to be leveraged for the discovery of genetic aberrations underlying rare, Mendelian conditions. Although the computational tools incorporated into diagnostic workflows for this task are continually evolving and improving, we nevertheless sought to investigate commonalities across sequencing processing workflows to reveal consensus and standard practice tools and highlight exploratory analyses where technical and theoretical method improvements would be most impactful. Methods: We collected details regarding the computational approaches used by a genetic testing laboratory and 11 clinical research sites in the United States participating in the Undiagnosed Diseases Network via meetings with bioinformaticians, online survey forms, and analyses of internal protocols. Results: We found that tools for processing genomic sequencing data can be grouped into four distinct categories. Whereas well-established practices exist for initial variant calling and quality control steps, there is substantial divergence across sites in later stages for variant prioritization and multimodal data integration, demonstrating a diversity of approaches for solving the most mysterious undiagnosed cases. Conclusion: The largest differences across diagnostic workflows suggest that advances in structural variant detection, noncoding variant interpretation, and integration of additional biomedical data may be especially promising for solving chronically undiagnosed cases.
UR - http://www.scopus.com/inward/record.url?scp=85100955668&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85100955668&partnerID=8YFLogxK
U2 - 10.1038/s41436-020-01084-8
DO - 10.1038/s41436-020-01084-8
M3 - Article
C2 - 33580225
AN - SCOPUS:85100955668
SN - 1098-3600
VL - 23
SP - 1075
EP - 1085
JO - Genetics in Medicine
JF - Genetics in Medicine
IS - 6
ER -