Integrating Multilayer Features of Convolutional Neural Networks for Remote Sensing Scene Classification

Erzhu Li; Junshi Xia; Peijun Du; Cong Lin; Alim Samat

doi:10.1109/tgrs.2017.2711275

Integrating Multilayer Features of Convolutional Neural Networks for Remote Sensing Scene Classification

Erzhu Li(Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application), Junshi Xia(The University of Tokyo), Peijun Du(Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application), Cong Lin(Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application), Alim Samat(Chinese Academy of Sciences)

IEEE Transactions on Geoscience and Remote Sensing

June 27, 2017

10.1109/tgrs.2017.2711275

Cited by 297

Abstract

Scene classification from remote sensing images provides new possibilities for potential application of high spatial resolution imagery. How to efficiently implement scene recognition from high spatial resolution imagery remains a significant challenge in the remote sensing domain. Recently, convolutional neural networks (CNN) have attracted tremendous attention because of their excellent performance in different fields. However, most works focus on fully training a new deep CNN model for the target problems without considering the limited data and time-consuming issues. To alleviate the aforementioned drawbacks, some works have attempted to use the pretrained CNN models as feature extractors to build a feature representation of scene images for classification and achieved successful applications including remote sensing scene classification. However, existing works pay little attention to exploring the benefits of multilayer features for improving the scene classification in different aspects. As a matter of fact, the information hidden in different layers has great potential for improving feature discrimination capacity. Therefore, this paper presents a fusion strategy for integrating multilayer features of a pretrained CNN model for scene classification. Specifically, the pretrained CNN model is used as a feature extractor to extract deep features of different convolutional and fully connected layers; then, a multiscale improved Fisher kernel coding method is proposed to build a mid-level feature representation of convolutional deep features. Finally, the mid-level features extracted from convolutional layers and the features of fully connected layers are fused by a principal component analysis/spectral regression kernel discriminant analysis method for classification. For validation and comparison purposes, the proposed approach is evaluated via experiments with two challenging high-resolution remote sensing data sets, and shows the competitive performance compared with fully trained CNN models, fine-tuning CNN models, and other related works.

Related Papers

No related papers found

Powered by citation graph analysis