Review: Li ICME’17 — CNN for JPEG Compressed Image Deblocking (Codec Filtering)
With Deep CNN, Outperforms ARCNN
In this story, a deep convolutional neural network (CNN) for compressed image deblocking, by Fudan University, is briefly reviewed. With the use of CNN, the decoded/reconstructed image can obtain higher PSNR. This is the paper in 2017 ICME. (Sik-Ho Tsang @ Medium)
Outline
- Network Architecture
- Experimental Results
1. Network Architecture
- The proposed model consists of 20 convolutional layers with taking Rectified Linear Unit as activation function. Each convolutional layer has 32 feature maps and the size of the convolution kernel is 5 × 5.
- Short skip connections and a long skip connection are used as shown above, inspired by ResNet.
After Decoding the JPEG as raw image, the decoded image goes through the proposed network for deblocking to obtain better image quality.
- The proposed network converges after about 2 × 10⁶ backprops.
- However, we can see that AR-CNN and SRCNN both converge after about 4.5 × 10⁸ backprops.
2. Experimental Results
- The proposed network obtains the highest PSNR, SSIM and PSNR-B.
- It outperforms ARCNN should be due to the reason of using a deeper network compared with ARCNN.
- The proposed network can have sharper (less blurry) images.
Reference
[2017 ICME] [Li ICME’17]
An Efficient Deep Convolutional Neural Networks Model for Compressed Image Deblocking
My Previous Reviews
Image Classification [LeNet] [AlexNet] [Maxout] [NIN] [ZFNet] [VGGNet] [Highway] [SPPNet] [PReLU-Net] [STN] [DeepImage] [SqueezeNet] [GoogLeNet / Inception-v1] [BN-Inception / Inception-v2] [Inception-v3] [Inception-v4] [Xception] [MobileNetV1] [ResNet] [Pre-Activation ResNet] [RiR] [RoR] [Stochastic Depth] [WRN] [Shake-Shake] [FractalNet] [Trimps-Soushen] [PolyNet] [ResNeXt] [DenseNet] [PyramidNet] [DRN] [DPN] [Residual Attention Network] [DMRNet / DFN-MR] [IGCNet / IGCV1] [MSDNet] [ShuffleNet V1] [SENet] [NASNet] [MobileNetV2]
Object Detection [OverFeat] [R-CNN] [Fast R-CNN] [Faster R-CNN] [MR-CNN & S-CNN] [DeepID-Net] [CRAFT] [R-FCN] [ION] [MultiPathNet] [NoC] [Hikvision] [GBD-Net / GBD-v1 & GBD-v2] [G-RMI] [TDM] [SSD] [DSSD] [YOLOv1] [YOLOv2 / YOLO9000] [YOLOv3] [FPN] [RetinaNet] [DCN]
Semantic Segmentation [FCN] [DeconvNet] [DeepLabv1 & DeepLabv2] [CRF-RNN] [SegNet] [ParseNet] [DilatedNet] [DRN] [RefineNet] [GCN] [PSPNet] [DeepLabv3] [LC] [FC-DenseNet] [IDW-CNN] [SDN]
Biomedical Image Segmentation [CUMedVision1] [CUMedVision2 / DCAN] [U-Net] [CFS-FCN] [U-Net+ResNet] [MultiChannel] [V-Net] [3D U-Net] [M²FCN] [SA] [QSA+QNT] [3D U-Net+ResNet]
Instance Segmentation [SDS] [Hypercolumn] [DeepMask] [SharpMask] [MultiPathNet] [MNC] [InstanceFCN] [FCIS]
Super Resolution [SRCNN] [FSRCNN] [VDSR] [ESPCN] [RED-Net] [DRCN] [DRRN] [LapSRN & MS-LapSRN] [SRDenseNet]
Human Pose Estimation [DeepPose] [Tompson NIPS’14] [Tompson CVPR’15] [CPM]
Codec Post-Processing [ARCNN] [Li ICME’17]