TY - JOUR
T1 - Mixed-UNet
T2 - Refined class activation mapping for weakly-supervised semantic segmentation with multi-scale inference
AU - Liu, Yang
AU - Lian, Lijin
AU - Zhang, Ersi
AU - Xu, Lulu
AU - Xiao, Chufan
AU - Zhong, Xiaoyun
AU - Li, Fang
AU - Jiang, Bin
AU - Dong, Yuhan
AU - Ma, Lan
AU - Huang, Qiming
AU - Xu, Ming
AU - Zhang, Yongbing
AU - Yu, Dongmei
AU - Yan, Chenggang
AU - Qin, Peiwu
N1 - Publisher Copyright:
Copyright © 2022 Liu, Lian, Zhang, Xu, Xiao, Zhong, Li, Jiang, Dong, Ma, Huang, Xu, Zhang, Yu, Yan and Qin.
PY - 2022/11/8
Y1 - 2022/11/8
N2 - Deep learning techniques have shown great potential in medical image processing, particularly through accurate and reliable image segmentation on magnetic resonance imaging (MRI) scans or computed tomography (CT) scans, which allow the localization and diagnosis of lesions. However, training these segmentation models requires a large number of manually annotated pixel-level labels, which are time-consuming and labor-intensive, in contrast to image-level labels that are easier to obtain. It is imperative to resolve this problem through weakly-supervised semantic segmentation models using image-level labels as supervision since it can significantly reduce human annotation efforts. Most of the advanced solutions exploit class activation mapping (CAM). However, the original CAMs rarely capture the precise boundaries of lesions. In this study, we propose the strategy of multi-scale inference to refine CAMs by reducing the detail loss in single-scale reasoning. For segmentation, we develop a novel model named Mixed-UNet, which has two parallel branches in the decoding phase. The results can be obtained after fusing the extracted features from two branches. We evaluate the designed Mixed-UNet against several prevalent deep learning-based segmentation approaches on our dataset collected from the local hospital and public datasets. The validation results demonstrate that our model surpasses available methods under the same supervision level in the segmentation of various lesions from brain imaging.
AB - Deep learning techniques have shown great potential in medical image processing, particularly through accurate and reliable image segmentation on magnetic resonance imaging (MRI) scans or computed tomography (CT) scans, which allow the localization and diagnosis of lesions. However, training these segmentation models requires a large number of manually annotated pixel-level labels, which are time-consuming and labor-intensive, in contrast to image-level labels that are easier to obtain. It is imperative to resolve this problem through weakly-supervised semantic segmentation models using image-level labels as supervision since it can significantly reduce human annotation efforts. Most of the advanced solutions exploit class activation mapping (CAM). However, the original CAMs rarely capture the precise boundaries of lesions. In this study, we propose the strategy of multi-scale inference to refine CAMs by reducing the detail loss in single-scale reasoning. For segmentation, we develop a novel model named Mixed-UNet, which has two parallel branches in the decoding phase. The results can be obtained after fusing the extracted features from two branches. We evaluate the designed Mixed-UNet against several prevalent deep learning-based segmentation approaches on our dataset collected from the local hospital and public datasets. The validation results demonstrate that our model surpasses available methods under the same supervision level in the segmentation of various lesions from brain imaging.
KW - Mixed-UNet
KW - acute cerebral infarction
KW - conditional random field
KW - multi-scale class activation mapping
KW - weakly-supervised semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85142293787&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85142293787&partnerID=8YFLogxK
U2 - 10.3389/fcomp.2022.1036934
DO - 10.3389/fcomp.2022.1036934
M3 - Article
AN - SCOPUS:85142293787
SN - 2624-9898
VL - 4
JO - Frontiers in Computer Science
JF - Frontiers in Computer Science
M1 - 1036934
ER -