A survey of transformers and large language models for ECG diagnosis: advances, challenges, and future directions

Mohammed Yusuf Ansari, Mohammed Yaqoob, Mohammed Ishaq, Eduardo Feo Flushing, Iffa Afsa changaai Mangalote, Sarada Prasad Dakua, Omar Aboumarzouk, Raffaella Righetti, Marwa Qaraqe

Research output: Contribution to journalArticlepeer-review

Abstract

Electrocardiograms (ECGs) are widely utilized in clinical practice as a non-invasive diagnostic tool for detecting cardiovascular diseases. Convolutional neural networks (CNNs) have been the primary choice for ECG analysis due to their capability to process raw signals. However, their localized convolutional operations limit the ability to capture long-range temporal dependencies across heartbeats, impeding a comprehensive cardiovascular assessment. To address these limitations, transformer-based frameworks have been introduced, employing self-attention mechanisms to effectively model complex temporal patterns over entire ECG sequences. Recent advancements in large language models (LLMs) have further expanded the utility of transformers by enabling multimodal integration and facilitating zero-shot diagnosis, thereby enhancing the scope of ECG-based clinical applications. Despite the increasing adoption of these methodologies, a comprehensive survey systematically examining transformer and LLM-based approaches for ECG analysis is absent from the literature. Consequently, this article surveys existing methods and proposes a novel hierarchical taxonomy based on the complexity of diagnosis, ranging from single-beat analysis to multi-beat and full-length signal evaluations. A thorough cross-category comparison is performed to highlight overarching commonalities and limitations. In light of these limitations, the paper presents a discussion of critical gaps and introduces new future directions aimed at improving ECG representation, enhancing positional encodings, refining self-attention architectures, and addressing challenges related to hallucinations and confidence measures in LLMs. The insights and guidelines presented aim to inform future research and clinical practices, enabling the next generation of intelligent ECG diagnostic systems.

Original languageEnglish (US)
Article number261
JournalArtificial Intelligence Review
Volume58
Issue number9
DOIs
StateE-pub ahead of print - Jun 4 2025

Keywords

  • Arrhythmia
  • ECG representation
  • Hallucination
  • Large language models
  • Myocardial infarction
  • Positional encoding
  • Self-attention architecture
  • Single-beat and multi-beat analysis
  • Sleep apnea
  • Zero-shot diagnosis

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'A survey of transformers and large language models for ECG diagnosis: advances, challenges, and future directions'. Together they form a unique fingerprint.

Cite this