TY - GEN
T1 - Human-Guided Feature Selection for Accurate Cardiomyocyte Dysfunction Classification
AU - Mehdi, Rana Raza
AU - Sahoo, Sukanya
AU - Neelakantan, Sunder
AU - Mendiola, Emilio A.
AU - Myers, Kyle
AU - Sadayappan, Sakthivel
AU - Avazmohammadi, Reza
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Early identification of cardiomyocyte dysfunction is a critical challenge for the prognosis of diastolic heart failure (DHF) exhibiting impaired left ventricular relaxation (ILVR). Myocardial relaxation relies strongly on efficient intracellular calcium (Ca2+) handling. During diastole, a sluggish removal of Ca2+ from cardiomyocytes disrupts sarcomere relaxation, leading to ILVR at the organ level. Characterizing myocardial relaxation at the cellular level requires analyzing both sarcomere length (SL) transients and intracellular calcium kinetics (CK). However, due to the complexity and redundancy in SL and CK data, identifying the most informative features for accurate classification is challenging. To address this, we developed a robust feature selection pipeline involving statistical significance testing (p-values), hierarchical clustering, and feature importance evaluation using random forest (RF) classification to select the most informative features from SL and CK data. SL and CK transients were obtained from prior studies involving a transgenic phospho-ablated mouse model exhibiting ILVR (AAA mice) and wild-type as non-transgenic control mice (NTG). By iteratively refining the feature set, we trained a RF classifier using the selected reduced features. For comparison, we evaluated the performance of the classifier using the full set of original features as well as a dimensionally reduced set derived through principal component analysis (PCA). The confusion matrices demonstrated that the reduced feature set achieved comparable performance to the full feature set and outperformed the PCA-based approach, while offering better interpretability by retaining biologically relevant features. These findings suggest that a small, carefully chosen set of biological features can effectively detect early signs of cardiomyocyte dysfunction.Clinical relevance - The proposed feature selection approach facilitates detecting cardiomyocyte dysfunction at an earlier stage, offering clinicians precise, interpretable insights to support faster diagnosis and intervention decisions in conditions like diastolic dysfunction.
AB - Early identification of cardiomyocyte dysfunction is a critical challenge for the prognosis of diastolic heart failure (DHF) exhibiting impaired left ventricular relaxation (ILVR). Myocardial relaxation relies strongly on efficient intracellular calcium (Ca2+) handling. During diastole, a sluggish removal of Ca2+ from cardiomyocytes disrupts sarcomere relaxation, leading to ILVR at the organ level. Characterizing myocardial relaxation at the cellular level requires analyzing both sarcomere length (SL) transients and intracellular calcium kinetics (CK). However, due to the complexity and redundancy in SL and CK data, identifying the most informative features for accurate classification is challenging. To address this, we developed a robust feature selection pipeline involving statistical significance testing (p-values), hierarchical clustering, and feature importance evaluation using random forest (RF) classification to select the most informative features from SL and CK data. SL and CK transients were obtained from prior studies involving a transgenic phospho-ablated mouse model exhibiting ILVR (AAA mice) and wild-type as non-transgenic control mice (NTG). By iteratively refining the feature set, we trained a RF classifier using the selected reduced features. For comparison, we evaluated the performance of the classifier using the full set of original features as well as a dimensionally reduced set derived through principal component analysis (PCA). The confusion matrices demonstrated that the reduced feature set achieved comparable performance to the full feature set and outperformed the PCA-based approach, while offering better interpretability by retaining biologically relevant features. These findings suggest that a small, carefully chosen set of biological features can effectively detect early signs of cardiomyocyte dysfunction.Clinical relevance - The proposed feature selection approach facilitates detecting cardiomyocyte dysfunction at an earlier stage, offering clinicians precise, interpretable insights to support faster diagnosis and intervention decisions in conditions like diastolic dysfunction.
UR - https://www.scopus.com/pages/publications/105023716117
UR - https://www.scopus.com/inward/citedby.url?scp=105023716117&partnerID=8YFLogxK
U2 - 10.1109/EMBC58623.2025.11253736
DO - 10.1109/EMBC58623.2025.11253736
M3 - Conference contribution
C2 - 41336955
AN - SCOPUS:105023716117
T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
BT - 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2025 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2025
Y2 - 14 July 2025 through 18 July 2025
ER -