Publications / 2019 Proceedings of the 36th ISARC, Banff, Alberta, Canada

Semantic Segmentation of Sewer Pipe Defects Using Deep Dilated Convolutional Neural Network

Mingzhu Wang and Jack C.P. Cheng
Pages 586-594 (2019 Proceedings of the 36th ISARC, Banff, Alberta, Canada, ISBN 978-952-69524-0-6)

Semantic segmentation of closed-circuit television (CCTV) images can facilitate automatic severity assessment of sewer pipe defects by assigning defect labels to each pixel in the image, from which defect types, locations and geometric information can be obtained. In this study, a deep convolutional neural network (CNN), namely DilaSeg, is developed based on dilated convolution for improving the segmentation of sewer pipe defects including cracks, tree root intrusion and deposit. Sewer pipe CCTV images are extracted from inspection videos and are annotated to be used as the ground truth labels for training the model. DilaSeg is constructed with dilated convolution for producing feature maps with high resolution. Both DilaSeg and the state-of-the-art model, fully convolutional network (FCN), are trained and evaluated on the annotated dataset using the same hyper-parameters. The results of the experiments indicate that the proposed DilaSeg improved the segmentation accuracy significantly compared with FCN, with 18% of increase in mean pixel accuracy (mPA) and 22% of increase in mean intersection over union (IoU) with a fast detection speed.

Keywords: Dilated convolution; Convolutional neural network (CNN); Semantic segmentation; Sewer pipe defect; Defect segmentation; Visual inspection