TY - JOUR
T1 - Development and validation of a machine learning-based approach to identify high-risk diabetic cardiomyopathy phenotype
AU - Segar, Matthew W.
AU - Usman, Muhammad Shariq
AU - Patel, Kershaw V.
AU - Khan, Muhammad Shahzeb
AU - Butler, Javed
AU - Manjunath, Lakshman
AU - Lam, Carolyn S.P.
AU - Verma, Subodh
AU - Willett, Du Wayne
AU - Kao, David
AU - Januzzi, James L.
AU - Pandey, Ambarish
N1 - Publisher Copyright:
© 2024 The Author(s). European Journal of Heart Failure published by John Wiley & Sons Ltd on behalf of European Society of Cardiology.
PY - 2024/10
Y1 - 2024/10
N2 - Aims: Abnormalities in specific echocardiographic parameters and cardiac biomarkers have been reported among individuals with diabetes. However, a comprehensive characterization of diabetic cardiomyopathy (DbCM), a subclinical stage of myocardial abnormalities that precede the development of clinical heart failure (HF), is lacking. In this study, we developed and validated a machine learning-based clustering approach to identify the high-risk DbCM phenotype based on echocardiographic and cardiac biomarker parameters. Methods and results: Among individuals with diabetes from the Atherosclerosis Risk in Communities (ARIC) cohort who were free of cardiovascular disease and other potential aetiologies of cardiomyopathy (training, n = 1199), unsupervised hierarchical clustering was performed using echocardiographic parameters and cardiac biomarkers of neurohormonal stress and chronic myocardial injury (total 25 variables). The high-risk DbCM phenotype was identified based on the incidence of HF on follow-up. A deep neural network (DeepNN) classifier was developed to predict DbCM in the ARIC training cohort and validated in an external community-based cohort (Cardiovascular Health Study [CHS]; n = 802) and an electronic health record (EHR) cohort (n = 5071). Clustering identified three phenogroups in the derivation cohort. Phenogroup-3 (n = 324, 27% of the cohort) had significantly higher 5-year HF incidence than other phenogroups (12.1% vs. 4.6% [phenogroup 2] vs. 3.1% [phenogroup 1]) and was identified as the high-risk DbCM phenotype. The key echocardiographic predictors of high-risk DbCM phenotype were higher NT-proBNP levels, increased left ventricular mass and left atrial size, and worse diastolic function. In the CHS and University of Texas (UT) Southwestern EHR validation cohorts, the DeepNN classifier identified 16% and 29% of participants with DbCM, respectively. Participants with (vs. without) high-risk DbCM phenotype in the external validation cohorts had a significantly higher incidence of HF (hazard ratio [95% confidence interval] 1.61 [1.18–2.19] in CHS and 1.34 [1.08–1.65] in the UT Southwestern EHR cohort). Conclusion: Machine learning-based techniques may identify 16% to 29% of individuals with diabetes as having a high-risk DbCM phenotype who may benefit from more aggressive implementation of HF preventive strategies.
AB - Aims: Abnormalities in specific echocardiographic parameters and cardiac biomarkers have been reported among individuals with diabetes. However, a comprehensive characterization of diabetic cardiomyopathy (DbCM), a subclinical stage of myocardial abnormalities that precede the development of clinical heart failure (HF), is lacking. In this study, we developed and validated a machine learning-based clustering approach to identify the high-risk DbCM phenotype based on echocardiographic and cardiac biomarker parameters. Methods and results: Among individuals with diabetes from the Atherosclerosis Risk in Communities (ARIC) cohort who were free of cardiovascular disease and other potential aetiologies of cardiomyopathy (training, n = 1199), unsupervised hierarchical clustering was performed using echocardiographic parameters and cardiac biomarkers of neurohormonal stress and chronic myocardial injury (total 25 variables). The high-risk DbCM phenotype was identified based on the incidence of HF on follow-up. A deep neural network (DeepNN) classifier was developed to predict DbCM in the ARIC training cohort and validated in an external community-based cohort (Cardiovascular Health Study [CHS]; n = 802) and an electronic health record (EHR) cohort (n = 5071). Clustering identified three phenogroups in the derivation cohort. Phenogroup-3 (n = 324, 27% of the cohort) had significantly higher 5-year HF incidence than other phenogroups (12.1% vs. 4.6% [phenogroup 2] vs. 3.1% [phenogroup 1]) and was identified as the high-risk DbCM phenotype. The key echocardiographic predictors of high-risk DbCM phenotype were higher NT-proBNP levels, increased left ventricular mass and left atrial size, and worse diastolic function. In the CHS and University of Texas (UT) Southwestern EHR validation cohorts, the DeepNN classifier identified 16% and 29% of participants with DbCM, respectively. Participants with (vs. without) high-risk DbCM phenotype in the external validation cohorts had a significantly higher incidence of HF (hazard ratio [95% confidence interval] 1.61 [1.18–2.19] in CHS and 1.34 [1.08–1.65] in the UT Southwestern EHR cohort). Conclusion: Machine learning-based techniques may identify 16% to 29% of individuals with diabetes as having a high-risk DbCM phenotype who may benefit from more aggressive implementation of HF preventive strategies.
KW - Diabetic cardiomyopathy
KW - Heart failure
KW - Type 2 diabetes mellitus
UR - http://www.scopus.com/inward/record.url?scp=85203257829&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85203257829&partnerID=8YFLogxK
U2 - 10.1002/ejhf.3443
DO - 10.1002/ejhf.3443
M3 - Article
C2 - 39240129
AN - SCOPUS:85203257829
SN - 1388-9842
VL - 26
SP - 2183
EP - 2192
JO - European Journal of Heart Failure
JF - European Journal of Heart Failure
IS - 10
ER -