Papers
arxiv:1810.12348

Gather-Excite: Exploiting Feature Context in Convolutional Neural Networks

Published on Oct 29, 2018
Authors:
,
,
,
,

Abstract

While the use of bottom-up local operators in convolutional neural networks (CNNs) matches well some of the statistics of natural images, it may also prevent such models from capturing contextual long-range feature interactions. In this work, we propose a simple, lightweight approach for better context exploitation in CNNs. We do so by introducing a pair of operators: gather, which efficiently aggregates feature responses from a large spatial extent, and excite, which redistributes the pooled information to local features. The operators are cheap, both in terms of number of added parameters and computational complexity, and can be integrated directly in existing architectures to improve their performance. Experiments on several datasets show that <PRE_TAG>gather-excite</POST_TAG> can bring benefits comparable to increasing the depth of a CNN at a fraction of the cost. For example, we find ResNet-50 with <PRE_TAG>gather-excite</POST_TAG> operators is able to outperform its 101-layer counterpart on ImageNet with no additional learnable parameters. We also propose a parametric <PRE_TAG>gather-excite</POST_TAG> operator pair which yields further performance gains, relate it to the recently-introduced Squeeze-and-Excitation Networks, and analyse the effects of these changes to the CNN feature activation statistics.

Community

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/1810.12348 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/1810.12348 in a dataset README.md to link it from this page.

Spaces citing this paper 1

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.