PadChannel: Improving CNN Performance through Explicit Padding Encoding (2311.07623v2)

Published 13 Nov 2023 in cs.CV

Abstract: In convolutional neural networks (CNNs), padding plays a pivotal role in preserving spatial dimensions throughout the layers. Traditional padding techniques do not explicitly distinguish between the actual image content and the padded regions, potentially causing CNNs to incorrectly interpret the boundary pixels or regions that resemble boundaries. This ambiguity can lead to suboptimal feature extraction. To address this, we propose PadChannel, a novel padding method that encodes padding statuses as an additional input channel, enabling CNNs to easily distinguish genuine pixels from padded ones. By incorporating PadChannel into several prominent CNN architectures, we observed small performance improvements and notable reductions in the variances on the ImageNet-1K image classification task at marginal increases in the computational cost. The source code is available at https://github.com/AussieSeaweed/pad-channel

Summary

We haven't generated a summary for this paper yet.

Summarize Now

PadChannel: Improving CNN Performance through Explicit Padding Encoding (2311.07623v2)

Summary

Related Papers