Brief Review — Channel-UNet: A Spatial Channel-Wise Convolutional Neural Network for Liver and Tumors Segmentation
Channel-UNet, U-Net With Spatial Channel-Wise Convolution
Channel-UNet: A Spatial Channel-Wise Convolutional Neural Network for Liver and Tumors Segmentation,
Channel-UNet, by Chinese Academy of Sciences, Peng Cheng Laboratory, Shenzhen Institutes of Advanced Technology, Shenzhen Institutes of Advanced Technology, Wuhan University, and The Chinese University of Hong Kong
2019 J. Frontiers in Genetics, Over 60 Citations (Sik-Ho Tsang @ Medium)
Medical Imaging, Medical Image Analysis, Image Segmentation, U-Net
- Channel-UNet is proposed, which takes U-Net as the main structure of the network with spatial channel-wise convolution added.
1.1. Overall Architecture
- Top: The backbone structure of the proposed Channel-Net is the U-Net one, with spatial channel-wise convolution used.
- Bottom: Each sub-module consists of two branch channels.
- Branch 1 is composed of multiple convolutional layers in series.
- Branch 2 is composed of multiple convolution layers and a spatial channel-wise convolution layer in series, which would extend the receptive field of the spatial channel-wise convolutional layers.
- The two branches are eventually concatenated.
1.2. Spatial Channel-Wise Convolution
- (A) Traditional convolution uses 1×1×N convolution kernels, where N represents the number of convolutional kernels whose value is equal to the number of output images:
- (B) Spatial channel-wise convolution uses 1×1×32² convolutional kernels to calculate spatial channel-wise convolution with three input images (32×32):
- Example: When nine different 1×1 convolution kernels are applied to a 3×3 image, the pixel values of both upper left and lower right corners are 1, while the pixel values of the other locations are 0, thus we can learn the mapping relationship between the upper left and lower right pixels.
- 3Dircadb dataset is used.
2.1. Ablation Study
- With the increase of the number of convolutional layers stacked in front of spatial channel-wise convolution layer, the Dice value increases first and then decreases.
When the convolutional layer number is 3, the Dice value is the highest.
The improvement of segmentation accuracy indicates that the information between the pixels on (x, y)-plane extracted by spatial channel-wise convolution is helpful to the recognition of tumors and liver.
2.2. SOTA Comparisons
Dice values of liver and tumors segmentation by Channel-UNet are 0.984 and 0.940 respectively, outperforms the current best method H-DenseUNet (Li et al., 2018b).
2.3. Qualitative Results
[2019 J. Frontiers in Genetics] [Channel-UNet]
Channel-Unet: A Spatial Channel-Wise Convolutional Neural Network for Liver and Tumors Segmentation
4.2. Biomedical Image Segmentation
2015 … 2019 [DUNet] [NN-Fit] [DUnet & ResDUnet] [Channel-Unet] 2020 [MultiResUNet] [UNet 3+] [Dense-Gated U-Net (DGNet)] [Non-local U-Net] 2021 [Expanded U-Net]