Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks (2209.11740v2)

Published 19 Sep 2022 in cs.CV, cs.AI, eess.SP, and stat.ML

Abstract: This paper focuses on improving the mathematical interpretability of convolutional neural networks (CNNs) in the context of image classification. Specifically, we tackle the instability issue arising in their first layer, which tends to learn parameters that closely resemble oriented band-pass filters when trained on datasets like ImageNet. Subsampled convolutions with such Gabor-like filters are prone to aliasing, causing sensitivity to small input shifts. In this context, we establish conditions under which the max pooling operator approximates a complex modulus, which is nearly shift invariant. We then derive a measure of shift invariance for subsampled convolutions followed by max pooling. In particular, we highlight the crucial role played by the filter's frequency and orientation in achieving stability. We experimentally validate our theory by considering a deterministic feature extractor based on the dual-tree complex wavelet packet transform, a particular case of discrete Gabor-like decomposition.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Hubert Leterme (5 papers)
  2. Kévin Polisano (12 papers)
  3. Valérie Perrier (7 papers)
  4. Karteek Alahari (48 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.