Review — Dong’s ICDSP’16: Morphological Analysis and Deep Learning (Camera Anomaly/Tampering Detection)

Morphological Analysis and Deep Learning to Detect Simple and Complicated Camera Anomalies Respectively

5 min readFeb 27, 2021

In this story, Camera Anomaly Detection based on Morphological Analysis and Deep Learning, by Zhejiang University of Technology, and Hebei University of Technology, is reviewed. In this paper:

A morphological analysis and deep learning based camera anomaly detection method are proposed to detect comprehensive types of anomaly.
Morphological analysis (Non deep learning approach) is used to detect simple camera anomalies to accelerate the processing speed.
Deep learning is utilized to detect complicated camera anomalies to improve the accuracy.

This is a paper in 2016 ICDSP. (Sik-Ho Tsang @ Medium) The above figure shows some examples of anomaly to be detected.

Outline

Anomaly Detection based on Morphological Analysis
Anomaly Detection based on Deep Learning
Experimental Results

1. Anomaly Detection based on Morphological Analysis

Morphological analysis based anomaly detection has sufficiently good performance to detect five camera anomalies.

1.1. Brightness Fault (Too Bright or Too Dark)

A brightness threshold L and a darkness threshold D are set.
Then, all grey level of pixels in a video image is compared with L and D.

If the number of pixels with grey level great than L is greater than threshold NL, the video image is identified as too-bright-anomaly.
Similarly, if the number of pixels with grey levels less than D is greater than threshold ND, the video image is identified as too-dark-anomaly.

1.2. Brightness Fault (Gain Disorder)

**YUV color space and U, V-dimensional coordinate diagram**

YUV color space is utilized to detect gain disorder.

As in the figure, the greater the distance from the origin, the deeper the color is. Video image is segmented by setting deep color pixel threshold dcp.

When the number of deep pixels is greater than 1/2 of the number of all pixels, and each partition of coordinate system as shown in the figure and each image partition have uniform number of deep pixels, the current video image is an image with gain disorder.

1.3. Freeze Abnormal

When a certain number of consecutive video images are identical, freeze abnormal appears.

1.4. Lose Abnormal

When a screen picture of a monitoring video is missing, blue or black screen presents.

Therefore, lose abnormal detection is implemented by counting the number of blue screen frames or black screen frames.

1.5. Scrolling Abnormal

In scrolling abnormal, the content change of adjacent image frames is little, but the locations of their corresponding contents are shaking.
If the adjacent image frames are aligned, ghosting phenomenon will occur and their contents will increase.
First, a gradient map of current image is calculated:

where I(x,y) is the pixel value of a pixel (x,y).

If ||fk(x,y)|-|fk+2(x,y)||>Tk, pixel (x,y) is identified as a contour point of the scrolling image region.

For current image, the number of contour points N1 and the number of the edge points N2 are calculated.

If |N2-N1|is greater than 1/10 of the area of current image, the current image frame is identified as scrolling image.

1.6 Color Cast (too green, too blue, too red, too yellow)

The RGB values first are transformed to the XYZ color space, and then the XYZ values are transformed to the Lab color models.

Here, the values of Xn, Yn and Zn are 95.047, 100, and 108.883 respectively.
After Lab values are calculated, image average chrominance D, central moment of image chrominance M.

Color cast factor K=D/M are computed.

Then, color cast is determined according to color cast factor calculation criteria:

**Color cast factor calculation criteria**

2. Anomaly Detection based on Deep Learning

**The architecture of the convolutional neural network (CNN)**

There are some complicated anomalies, such as strip interference, camera occlusion and image blurring, which are difficult to be detected by morphological analysis based detection methods.
The CNN is used to detect such complicated anomalies.
It consists of 4 convolutional layers, 3 pooling layers, 1 fully-connected layer. Softmax loss is used to train a classifier.
(It is not sure but I believe the single current frame is used as input for anomaly detection of the current frame. The number of classes to be predicted or the number of output neurons is also not sure. Not much information about the network is provided in the paper.)

3. Experimental Results

A certain amount of surveillance videos from two real application scenarios is collected.
One scenario is a residence community where the VSS includes 10 16-channel Hikvision DVR surveillance devices and 160 cameras.
Another is a school where the VSS includes 14 16-channel Hikvision DVR surveillance devices and 194 cameras.

**Accuracy-time relation table for residence community test**

**Accuracy-time relation table for school test**

The accuracy of the proposed method in two application scenarios exceeds 95%.

The false alarm rates in various different times of a day are shown above.
False alarms occur when light condition changes relatively greatly.
“detection-train-detection” tactics is applied in which video images leading to false alarms are used as training samples to train a new classifier for the following camera anomaly detection.
As shown in the above tables, the tactics enhances the robustness of the anomaly detection method and prevents the recurrence of the same false alarm.

**Deep learning-based method vs. morphological analysis based method for strip interference and camera occlusion**

As shown above, deep learning based detection method has more excellent accuracy for complicated anomalies.
As time goes by, when enough samples leading to false alarm are added as training samples, the accuracy of our deep learning based detection method can be further improved.

Reference

[2016 ICDSP] [Dong’s ICDSP’16]
Camera Anomaly Detection based on Morphological Analysis and Deep Learning

Camera Tampering Detection

2016 [Dong’s ICDSP’16] 2019 [VFI-ConvLSTM] [UHCTD]