Improving image segmentation with contextual and structural similarity

Xiaoyang Chen, Qin Liu, Hannah H. Deng, Tianshu Kuang, Henry Hung Ying Lin, Deqiang Xiao, Jaime Gateno, James J. Xia, Pew Thian Yap

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.

Original languageEnglish (US)
Article number110489
JournalPattern Recognition
Volume152
DOIs
StatePublished - Aug 2024

Keywords

  • Cone-beam computed tomography
  • Image segmentation
  • Inter-voxel relationships
  • Pancreas segmentation

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Improving image segmentation with contextual and structural similarity'. Together they form a unique fingerprint.

Cite this