Squeeze-and-Attention Networks for Semantic Segmentation

Zilong Zhong(Sun Yat-sen University), Zhong Qiu Lin(University of Waterloo), Rene Bidart(University of Waterloo), Xiaodan Hu(University of Waterloo), Ibrahim Ben Daya(University of Waterloo), Zhifeng Li, Wei‐Shi Zheng(Ministry of Education of the People's Republic of China), Jonathan Li(University of Waterloo), Alexander Wong(University of Waterloo)
Unknown
June 1, 2020
Cited by 327

Abstract

The recent integration of attention mechanisms into segmentation networks improves their representational capabilities through a great emphasis on more informative features. However, these attention mechanisms ignore an implicit sub-task of semantic segmentation and are constrained by the grid structure of convolution kernels. In this paper, we propose a novel squeeze-and-attention network (SANet) architecture that leverages an effective squeeze-and-attention (SA) module to account for two distinctive characteristics of segmentation: i) pixel-group attention, and ii) pixel-wise prediction. Specifically, the proposed SA modules impose pixel-group attention on conventional convolution by introducing an 'attention' convolutional channel, thus taking into account spatial-channel inter-dependencies in an efficient manner. The final segmentation results are produced by merging outputs from four hierarchical stages of a SANet to integrate multi-scale contexts for obtaining an enhanced pixel-wise prediction. Empirical experiments on two challenging public datasets validate the effectiveness of the proposed SANets, which achieves 83.2 % mIoU (without COCO pre-training) on PASCAL VOC and a state-of-the-art mIoU of 54.4 % on PASCAL Context.


Related Papers

No related papers found

Powered by citation graph analysis