Aerial Scene Classification by Surf Feature Extarction Using Unsupervised Learning


  • S. G.Hymlin Rose
  • T. D.Subha



aerial, SVM, unsupervised, etc.


The high-resolution satellite imagery which consisted of rich data allow us to directly model aerial scenes by understanding their spatial and structural patterns. Efficient representation and recognition of scenes from image data are challenging. For satellite image analysis, pixel and object based classification approaches are widely used but these approaches often exploit the high-fidelity image data only in a limited way. In this paper, we explore a supervised feature learning approach for aerial scene classification. This system follows some peculiar steps like Noise Removal, Feature Extraction (SURF), Feature Learning and Classification. SURF features are extracted from the input image and classification is done by Latent Dirichlet Allocation Algorithm. This technique can be applied to several challenging aerial scene data sets: ORNL-I data set consisting of 1-m spatial resolution satellite imagery, UCMERCED data set with sub-meter resolution, and ORNL-II data set for large-facility scene detection. The proposed method is highly effective in developing a detection system that can be used to automatically scan large-scale high-resolution satellite imagery for detecting large facilities such as a shopping mall.




[1] Y. Yang and S. Newsam, “Bag-of-visual-words and spatial extensions for land use classification,†in Proc. ACM Int. Conf. Adv.Geogr.Inf. Syst., 2010, pp. 270– 279.

[2] M. Pesaresi and A. Gerhardinger, “Improved textural built-up presence index for automatic recognition of human settlements in arid regions with scattered vegetation,†IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens., vol. 4, no. 1, pp. 16–26, Mar.2011.

[3] I. A. Rizvi and B. K. Mohan, “Object-based image analysis of high resolution satellite images using modified cloud basis function neural network and probabilistic relaxation labeling process, â€IEEE Trans. Geosci. Remote Sens., vol. 49, no.12, pp. 4815–4820, Dec 2011.

[4] R. Bellens, S. Gautama, L. Martinez-Fonte, W. Philips, J. C.-W. Chan and F. Canters, “Improved classification of VHR images of urban areas using directional morphological profiles,†IEEE Trans. Geosci. Remote Sens., vol. 46, no. 10, pp. 2803–2813, Oct. 2008.

[5] A. K. Shackelford and C. H. Davis, “A combined fuzzy pixel-based and object- based approach for classification of high-resolution multispectral data over urban areas,†IEEE Trans. Geosci. Remote Sens., vol. 41, no. 10, pp. 2354–2363, Oct. 2003.

[6] P. Gamba, F. Dell’Acqua, G. Lisini and G. Trianni, “ Improved VHR urban area mapping exploiting object boundaries,†IEEE Trans. Geosci.Remote Sens.,vol. 45, no. 8, pp. 2676–2682, Aug. 2007.

[7] J. Sivic and A. Zisserman, “Video google: A text retrieval approach to object matching in videos,†in Proc. IEEE Int. Conf. Comput. Vis., 2003, pp. 1470–1477.

[8] Y.-L. Boureau, F. Bach, Y. LeCun and J. Ponce, “Learning mid-level features for recognition,†in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2010, pp. 2559–2566.

[9] S. Lazebnik, C. Schmid and J. Ponce, “Beyond bags of features: Spatial pyramid matching f or recognizing natural scene categories,†in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2006, vol. 2, pp. 2169–2178.

[10] D. G Lowe, “Object recognition from local scale-invariant features,†in Proc. IEEE Int. Conf. Comput.Vis.,Kerkyra,Greece, 1999, pp. 1150–1157.

View Full Article: