Smoothed dilated convolutions for improved dense prediction

Dilated convolutions, also known as atrous convolutions, have been widely explored in deep convolutional neural networks (DCNNs) for various dense prediction tasks. However, dilated convolutions suffer from the gridding artifacts, which hampers the performance. In this work, we propose two simple ye...

Full description

Saved in:

Bibliographic Details
Published in	Data mining and knowledge discovery Vol. 35; no. 4; pp. 1470 - 1496
Main Authors	Wang, Zhengyang, Ji, Shuiwang
Format	Journal Article
Language	English
Published	New York Springer US 01.07.2021 Springer Nature B.V
Subjects	Artificial Intelligence Artificial neural networks Chemistry and Earth Sciences Computer Science Convolution Data Mining and Knowledge Discovery Decomposition Deep learning Information Storage and Retrieval Neural networks Parameters Performance enhancement Physics Semantics Smoothing Statistics for Engineering Training Deep learning Dilated convolutions Gridding artifacts
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Dilated convolutions, also known as atrous convolutions, have been widely explored in deep convolutional neural networks (DCNNs) for various dense prediction tasks. However, dilated convolutions suffer from the gridding artifacts, which hampers the performance. In this work, we propose two simple yet effective degridding methods by studying a decomposition of dilated convolutions. Unlike existing models, which explore solutions by focusing on a block of cascaded dilated convolutional layers, our methods address the gridding artifacts by smoothing the dilated convolution itself. In addition, we point out that the two degridding approaches are intrinsically related and define separable and shared (SS) operations, which generalize the proposed methods. We further explore SS operations in view of operations on graphs and propose the SS output layer, which is able to smooth the entire DCNNs by only replacing the output layer. We evaluate our degridding methods and the SS output layer thoroughly, and visualize the smoothing effect through effective receptive field analysis. Results show that our methods degridding yield consistent improvements on the performance of dense prediction tasks, while adding negligible amounts of extra training parameters. And the SS output layer improves the performance by 3.3% and contains only 9% training parameters of the original output layer.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1384-5810 1573-756X
DOI:	10.1007/s10618-021-00765-5