TY - JOUR
T1 - Systems biology analysis of human genomes points to key pathways conferring spina bifida risk
AU - Aguiar-Pulido, Vanessa
AU - Wolujewicz, Paul
AU - Martinez-Fundichely, Alexander
AU - Elhaik, Eran
AU - Thareja, Gaurav
AU - Aleem, Alice Abdel
AU - Chalhoub, Nader
AU - Cuykendall, Tawny
AU - Al-Zamer, Jamel
AU - Lei, Yunping
AU - El-Bashir, Haitham
AU - Musser, James M.
AU - Al-Kaabi, Abdulla
AU - Shaw, Gary M.
AU - Khurana, Ekta
AU - Suhre, Karsten
AU - Mason, Christopher E.
AU - Elemento, Olivier
AU - Finnell, Richard H.
AU - Elizabeth Ross, M.
N1 - Funding Information:
ACKNOWLEDGMENTS. We thank Ms. Amira Assad, Project Coordinator at Weill Cornell Medicine-Qatar, for invaluable efforts toward patient enrollment. We thank the California Department of Public Health Maternal Child and Adolescent Health Division for providing data. The findings and conclusions herein are ours and do not necessarily represent the official position of the California Department of Public Health. This project was supported by the NIH (Grants P01HD067244, R01NS076465, R01HD081216, and T32HD060600) and the Qatar Foundation (Grant NPRP4-149-3-049; the Biomedical Research Program at Weill Cornell Medicine-Qatar).
Publisher Copyright:
© 2021 National Academy of Sciences. All rights reserved.
PY - 2021/12/21
Y1 - 2021/12/21
N2 - Spina bifida (SB) is a debilitating birth defect caused by multiple gene and environment interactions. Though SB shows non-Mendelian inheritance, genetic factors contribute to an estimated 70% of cases. Nevertheless, identifying human mutations conferring SB risk is challenging due to its relative rarity, genetic heterogeneity, incomplete penetrance, and environmental influences that hamper genome-wide association studies approaches to untargeted discovery. Thus, SB genetic studies may suffer from population substructure and/or selection bias introduced by typical candidate gene searches. We report a population based, ancestry-matched whole-genome sequence analysis of SB genetic predisposition using a systems biology strategy to interrogate 298 case-control subject genomes (149 pairs). Genes that were enriched in likely gene disrupting (LGD), rare protein-coding variants were subjected to machine learning analysis to identify genes in which LGD variants occur with a different frequency in cases versus controls and so discriminate between these groups. Those genes with high discriminatory potential for SB significantly enriched pathways pertaining to carbon metabolism, inflammation, innate immunity, cytoskeletal regulation, and essential transcriptional regulation consistent with their having impact on the pathogenesis of human SB. Additionally, an interrogation of conserved noncoding sequences identified robust variant enrichment in regulatory regions of several transcription factors critical to embryonic development. This genome-wide perspective offers an effective approach to the interrogation of coding and noncoding sequence variant contributions to rare complex genetic disorders.
AB - Spina bifida (SB) is a debilitating birth defect caused by multiple gene and environment interactions. Though SB shows non-Mendelian inheritance, genetic factors contribute to an estimated 70% of cases. Nevertheless, identifying human mutations conferring SB risk is challenging due to its relative rarity, genetic heterogeneity, incomplete penetrance, and environmental influences that hamper genome-wide association studies approaches to untargeted discovery. Thus, SB genetic studies may suffer from population substructure and/or selection bias introduced by typical candidate gene searches. We report a population based, ancestry-matched whole-genome sequence analysis of SB genetic predisposition using a systems biology strategy to interrogate 298 case-control subject genomes (149 pairs). Genes that were enriched in likely gene disrupting (LGD), rare protein-coding variants were subjected to machine learning analysis to identify genes in which LGD variants occur with a different frequency in cases versus controls and so discriminate between these groups. Those genes with high discriminatory potential for SB significantly enriched pathways pertaining to carbon metabolism, inflammation, innate immunity, cytoskeletal regulation, and essential transcriptional regulation consistent with their having impact on the pathogenesis of human SB. Additionally, an interrogation of conserved noncoding sequences identified robust variant enrichment in regulatory regions of several transcription factors critical to embryonic development. This genome-wide perspective offers an effective approach to the interrogation of coding and noncoding sequence variant contributions to rare complex genetic disorders.
KW - Myelomeningocele
KW - Neural tube defects
KW - Pathway analysis
KW - Rare variant enrichment
KW - Whole-genome sequence
UR - http://www.scopus.com/inward/record.url?scp=85122629178&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85122629178&partnerID=8YFLogxK
U2 - 10.1073/pnas.2106844118
DO - 10.1073/pnas.2106844118
M3 - Article
C2 - 34916285
AN - SCOPUS:85122629178
VL - 118
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
SN - 0027-8424
IS - 51
M1 - e2106844118
ER -