Polarimetric synthetic aperture radar (PolSAR) has emerged through the evolution of airborne and satellite remote sensing.220.127.116.11.–6 PolSAR application requirements have become increasingly important because the mismatch between the vast amount of remote sensing data and inadequate information processing capabilities has become significant. PolSAR data contain information on various scattering mechanisms of different terrain structures and materials.18.104.22.168.12.13.–14 Currently, PolSAR is used as a type of radar that can provide terrain classification information.
In recent years, PolSAR research in remote sensing has become widespread because a greater amount of information on scattering objects is found in PolSAR data than in single polarization and double polarization data.15,16 PolSAR data depict the state changes of microwave polarization, which is produced by the dielectric constant and terrain structure. Therefore, terrain features can be obtained from the PolSAR data.1 Moreover, terrain classification of PolSAR imagery is considered a typical PolSAR application, which employs features and classifiers to separate different terrain types.
Currently, PolSAR image classification has been successfully applied to many practical problems. Related information processing techniques are being continuously developed, such as image filtering, feature extraction, and classification algorithms. PolSAR can measure each observation terrain using a full scattering matrix. Its feature distribution complexity often results in scattered signals of different terrains with similar features, which increases the difficulty of discriminating information extraction. Many algorithms have been proposed4,5 for polarimetric data analyses, such as eigenvalue analysis, polarization decomposition, and so on.
Feature extraction and the classifier structure are two important factors that affect the final classification result. Pixel-based PolSAR terrain classification specifies the category of each pixel in the feature space. These features indicate the pixel characteristics and can be Pauli features, grayscales, textures, and so on. They must be extracted and used as input for the terrain discrimination. Different objects have different surface features. Therefore, the PolSAR image textural features have location discontinuities in the properties of adjacent image regions. Thus, they can be considered features for classification. Because the texture features contain rich identification information, they are important for classification.17
Several approaches exist for extracting the texture features, such as the gray-level co-occurrence matrix (GLCM), wavelets, Gabor filters, and local binary patterns (LBPs). Furthermore, textural features have been utilized for object detection and terrain classification.18,19 An efficient use of the Gabor filter was demonstrated by Leone and Distante20 for terrain detection in a textured scene. The LBPs are utilized to evaluate the similarity of neighboring image regions.21 Nonetheless, GLCM remains a more popular method.22 More effective terrain classification performance can be achieved by combining different textures, and the resulting discrimination performance is significantly improved.
Many classification methods exist, such as the classical maximum likelihood,23 neural networks,24 and the decision tree methods.25 In synthetic aperture radar (SAR) image classification and segmentation, the statistical distribution model has been commonly used. Among these models, the Markov random field (MRF)26 is an efficient approach for image classification. However, in addressing nonstationary SAR images, MRF has difficulty obtaining improved results. Meanwhile, the support vector machine (SVM)2728.–29 is a supervised method based on the pixel level. As a discriminative model, SVM directly employs training sample modeling based on the structural risk minimization criterion. It can thereby handle the linear inseparability problem by introducing the kernel function. Moreover, it does not have local minima because it solves the convex quadratic programming problem. Nonetheless, it does not consider spatial information.
The classifier of the naive Bayes method is a probabilistic classifier based on the independence assumption of Bayes’ theorem. Meanwhile, the -nearest neighbor (KNN) algorithm, proposed by Cover and Hart,30 has high precision and is not sensitive to outliers. In addition, discriminant analysis is a renowned statistical recognition method that has been successfully applied to category prediction.31 Recently, neural networks have been introduced for image classification on account of their ability to approximate nonlinear functions in the input space.32,33 Moreover, the extreme learning machine (ELM)34 is a feedforward neural network with a single hidden layer. It has strong generalization performance and a very fast classification speed.
With consideration of the above approaches, we herein present and discuss terrain classification of PolSAR imagery based on ensemble learning with Pauli and Gabor features. The main contributions of this paper are the following. Polarimetric and texture features are combined for PolSAR image classification. We utilize an efficient method for PolSAR image feature representation. Our method adopts an ensemble learning strategy and selects five base classifiers to form an ensemble classifier. These classifiers are not only more accurate but also distinct from each other. The weight of each base classifier is obtained by a differential evolution (DE) algorithm.
The remainder of this paper is organized as follows. Pixel feature representation is introduced in Sec. 2. In Sec. 3, the classification system construction is presented. We provide and analyze the simulation results of the experiment in Sec. 4. In Sec. 5, our conclusions and future work are discussed.
Pixel Feature Representation
Image segmentation requires the assignment of categories to all pixels in an image. Therefore, accurate pixel feature extraction is very important for image segmentation.
The image texture is determined based on the differences in spatial distributions in the image, specifically the frequency and intensity of each pixel. In this study, we examine the use of enhanced discriminatory information to improve classification accuracy. We therefore introduce texture features for classification. The Gabor transform has been widely applied in image processing. The texture features are derived by convolving the image and Gabor filter banks. A set of two-dimensional (2-D) Gabor filters is employed to convolve the image. A 2-D Gabor function is formed by a sinusoidal plane wave with a certain frequency and direction. It is modulated by a 2-D Gaussian function.35 In the spatial domain, the Gabor filter is formulated as
The ratio specifies the bandwidth of the spatial frequency of simple cells. The bandwidth, , (in octaves) of the half-response spatial frequency and ratio are given as follows:36
The Gaussian smoothing function is used to perform feature extraction. It is defined as follows:
Note that we apply a bicubic interpolation method to restore the image size. Finally, we obtain the 96-dimensional Gabor feature.
Principal component analysis (PCA) is an important method of identifying data patterns. PCA can reduce the dimensions without losing excessive information. It is thus often applied to image compression. We therefore employed PCA to reduce the Gabor feature dimensions to five. Figures 1(a)–1(f) show the original grayscale imagery, and the first, third, second, fourth, and fifth principal component imagery, respectively, which were obtained by the PCA reduction of the Gabor feature dimensions.
The classification results of polarimetric feature extraction by the Pauli decomposition method are more stable and effective in polarimetric decomposition methods. Therefore, Pauli features were utilized to extract information on the terrain scattering mechanism. Analysis of PolSAR image data is generally based on a matrix. The complex scattering matrix corresponding to the scattering terrain can be measured by PolSAR with different polarizations.7 Four combinations exist for reception and transmission polarizations, namely, horizontal–horizontal (HH), horizontal–vertical (HV), vertical–vertical (VV), and vertical–horizontal (VH). Scattering matrix characterizes the relations between the polarization states of the microwaves scattered and those received by the terrains. The general form of the scattering process can be given by
In Eq. (7), the parameters , , and , are, respectively, the amplitudes and the phases. PolSAR can obtain the information of the scattering terrains from the five parameters.37 The formula eigenvalue analysis forms are generally provided based on coherency matrix . The coherency matrix is represented by the Pauli feature vector as follows:
The Pauli feature is used in this study. The Pauli feature vector is given byFigure 2 shows the results of the Pauli decomposition of the PolSAR image data from San Francisco Bay, California.
With consideration of the correlations between these descriptors and their ability to represent actual terrain imagery, we chose a reasonable combination of features through experiments for terrain classification.
Classification System Construction
The primary goal of constructing the ensemble classifier is to increase the classifier capability by combining the superior performance of the various base learners. In general, to obtain effective integration, the base classifiers should be as precise and distinct as possible. This requirement was shown by Krogh and Vedelsby.38 In this study, to construct an effective classifier, we explore and construct an ensemble-weighted voting classifier, including two SVMs, which are formed by the two kernel functions of radial basis function (RBF) and sigmoid, as well as ELM, KNN, and the discriminant analysis classifier. These classifiers have lower computational complexity and different theoretical backgrounds and can avoid redundancies and biases.
SVM is a supervised learning model with an associated learning method that analyzes the classification data. The SVM algorithm first establishes a model by training samples. Using this model, the new test samples are classified. It is a nonprobabilistic classifier. Intuitively, a good classification result of the SVM classifier is realized by the optimal separating hyperplane. In general, the larger the margin is, the lower the classification error rate is. SVM can efficiently find nonlinear solutions using the “kernel trick.” Then different types of SVM classifiers are formed by various kernel functions, including RBF, sigmoid, linear, and polynomial functions.
An effective learning method, ELM, is a feedforward neural network with one hidden layer. In this method, the weights of the hidden layer nodes are stochastically selected. The weights of the output layer are slated by the least-squares method. ELM can be derived from randomly generated hidden neurons and independent training data. It offers significant advantages over conventional neural network learning algorithms, including ease of implementation, a fast learning speed, and minimal need for human intervention. The pseudocode for ELM is given in Table 1. A more detailed introduction to ELM can be found in Ref. 34.
Algorithmic steps for ELM.
|Input: Number of hidden layer nodes, training dataset , activation function .|
|Output: The optimal output weight|
|I. For to do|
|II. Randomly assign the weight vector connecting the ’th hidden node and the input nodes .|
|III. Randomly assign the threshold of the ’th hidden node .|
|IV. End for|
|V. Calculate the hidden layer output matrix .|
|VI. , where is the Moore–Penrose generalized inverse (pseudoinverse) of the hidden layer output matrix , and .|
Meanwhile, KNN is an instance-based learning algorithm. The output is a member of the class in the KNN classification. A sample is classified according to the majority of votes of its adjacent samples, with the sample being specified in the category of its KNN samples. In addition, discriminant analysis is a method for learning identification features. It has been applied to predict the category to which a subject belongs. There are two basic steps in discriminant analysis. The first step is to estimate the weight factors, which can be used to characterize the attributes of the known samples and compute the measurements of their trends. In the second step, information is applied to create the decision-making rule that ensures some threshold for forecasting.
We designed the presented classification model based on the requirements of PolSAR terrain classification. We identified the favorable weights of all base classifiers using the approaches presented in Ref. 39. The main steps are given in Table 2.
Algorithmic steps for DE optimization.
|Input: Parameters in DE: crossover rate (CR), variation coefficients , and population scale .|
|Output: Optimal weights () for weighted voting ensemble learning method based on differential evolution (DEWVote).|
|I. Initialization: Create the random species with uniform distribution of individuals, . Here, is a vector that expresses the weight coefficients () to the base classifiers.|
|II. Set the generation iterator .|
|While the stopping criterion is not satisfied|
|For (; ;)|
|III. Randomly select the indices , , and , which are different from each other and from index .|
|IV. Calculate a mutated vector using I.|
|V. Obtain random number .|
|For (; ; )|
|VI. Decide trial individual using IV|
|VII. Calculate the fitness of and using a ten-fold crossover operation, and update of the next generation () using V.|
|VIII. Update generation iterator .|
The scheme of the classification method is illustrated in Fig. 3. The implementing procedure of this methodology is described as follows.
Step 1: Extract textural features based on the Gabor transform and obtain Pauli features from the PolSAR image data. The most suitable texture feature is selected through experiments. The combination of the selected texture features and Pauli features is used for the classification systems.
Step 2: Train ELM, KNN, discriminant analysis, and the two SVM classifiers using training samples.
Step 3: Optimize the weights of each individual classifier to obtain an ensemble classifier. Search the optimal weights by DE. The algorithmic steps of optimizing the weights are given in Table 2. Details are provided in Ref. 39.
Step 4: Provide the prediction category label, , of the weighted voting for each sample, . The operation is as follows:
Experiments and Discussion
To verify the performance of the proposed PolSAR image classification scheme, we adopted airborne PolSAR data of San Francisco Bay. The PolSAR data were presented with no header in the STK-MLC format with . They provided coverage of three classes: urban areas, ocean, and vegetation [Fig. 4(a)].
First, we selected five groups of from each category and composed the initial training sample set. Each class had 2000 training samples. The building unit block size is shown in Fig. 4(b). We determined the testing accuracy rate to more accurately compare the various methods. Owing to the lack of a real and reliable terrain map, we selected 112,500 pixels that could be specified as the labels of test samples. The image-marked test samples are shown in Fig. 4(c). In the DE algorithm, the choice of DE parameters can significantly influence the optimal performance. For simplicity, we set factor , , population , and maximum iteration number . All experiments were implemented by MATLAB software on a Windows 10 64-bit system with an Intel(R) Core(TM) i7-4790 CPU at 3.60 GHz and 16 GB of RAM.
To achieve feature selection, we compared the effectiveness in different composite approaches with polarimetric and Gabor features using the classifier based on discriminant analysis. This classifier is a highly efficient algorithm for building image classifiers. We used the “linear” discriminant type in the experiment. We randomly selected 100 samples from the initial training sample set as the training sample set.
The composition schemes were: (a) Pauli features, (b) Pauli features and the first principal component feature of the Gabor feature reduced in dimension by PCA, (c) Pauli features and the third principal component feature of the Gabor feature reduced in dimension by PCA, (d) Pauli features and the second principal component feature of the Gabor feature reduced in dimension by PCA, (e) Pauli features and the fourth principal component feature of the Gabor feature reduced in dimension by PCA, and (f) Pauli features and the fifth principal component feature of the Gabor feature reduced in dimension by PCA. The classification accuracies using different features were evaluated, as shown in Fig. 5 and Table 3, where textural features change the classification accuracies on the three terrain areas.
Classification accuracies of the discriminant analysis classifier in different compositions of features when the Gabor feature is reduced to different dimensions. [Accuracy (%); “linear” specifies the discriminant type.]
|Dimension down to 1||96.5631||97.5867|
|Dimension down to 2||96.5089||97.5867|
|Dimension down to 3||96.8218||96.5089||97.5867|
|Dimension down to 4||96.7684||96.8218||96.5089||97.5867|
|Dimension down to 5||96.5507||96.7684||96.8218||96.5089||97.5867|
The single principal component feature of the Gabor feature reduced in dimension by PCA is important in distinguishing the different terrain pixels. To elucidate this importance, we considered different composition schemes of Pauli features and five principal component features of the Gabor feature reduced in dimension by PCA for each pixel. Figure 5 shows the terrain classification results using different composite approaches with Pauli features and Gabor features when the dimensionality of the Gabor feature is reduced to 5. The classification effects with different combinations of features when the Gabor feature is reduced in dimension are shown in Table 3.
Based on the data in Table 3, it is possible to assess the distinction ability of each composition feature. The table shows that the composition of Pauli features and the final principal component feature of the Gabor feature reduced in dimension by PCA has the best classification results (accuracy: 97.5867%). In addition, a regular pattern shows that the classification accuracies are all the same for the composite approach with Pauli features and the final principal component feature of the Gabor feature reduced in dimension.
We compared the performances of eight classifiers, including the five basic classifiers, namely, ELM, KNN, discriminant analysis, and the two SVM classifiers, and the three ensemble classifiers, specifically AdaBoostM2, random forest, and the proposed method. In view of the previous experimental results, we adopted the combination of Pauli features and the fifth principal component feature of the Gabor feature reduced to five dimensions by PCA. Each pixel was represented by the four-dimensional feature vector, which was comprised of the three-dimensional Pauli feature vector and one-dimensional Gabor feature.
We randomly selected 300 samples from the initial training sample set as the training sample set. According to the experimental results (Table 4), , and the discriminant analysis classifier is designated as a linear discriminant type. The AdaBoostM2 algorithm randomly selected an ensemble of 400 trees and used the default tree options. Random forest chose 500 as (the number of trees). We applied fivefold cross-validation on the training set to adjust the SVM parameters.
Classification accuracies of the eight classifiers [accuracy (%)].
|Classifier||KNN classifier||Discriminant analysis classifier||Sigmoid SVM classifier||RBF SVM classifier||ELM classifier||AdaBoostM2 algorithm||Random forest||Proposed classification method|
|Accuracy (one time)||96.3929 ()||97.8560 ()||95.1627||96.0782||98.1947||93.5071 ()||95.3493||98.3498|
|Accuracy (two times)||96.7280 ()||96.8498 ()||97.8222||93.8849 ()||95.3467||98.1582|
|Accuracy (three times)||96.3858 ()||91.8320 ()||97.9458||93.9213 ()||95.3156||98.2365|
|Accuracy (four times)||96.6062 ()||90.4871 ()||97.9147||93.9773 ()||95.1796||98.2137|
|Accuracy (five times)||96.2996 ()||96.8498 ()||97.8187||93.9773 ()||95.2204||98.1942|
|Maximum accuracy (%), : 96.7280||Maximum accuracy (%), : 97.8560||Accuracy (%): 95.1627||Accuracy (%): 96.0782||Average accuracy (%): 97.9392||Maximum accuracy (%), , 500: 93.9773||Average accuracy (%): 95.2823||Average accuracy (%): 98.2305|
The classification maps of the eight classifying methods are shown in Fig. 6. To obtain an improved predictive accuracy measure, we compared these methods using the average of the five estimates for the ELM classifier, random forest classifier, and the proposed classification method. We employed the best estimate value for the KNN classifier, discriminant analysis classifier, and AdaBoostM2 algorithm when ; the discriminant analysis classifier was linear, and the AdaBoostM2 algorithm randomly selected an ensemble of 400 trees. Table 4 shows the precision values of the eight classifiers.
Note: (1) denotes the number of nearest neighbors. (2) specifies the discriminant type: , “linear;” , “quadratic;” , “diagQuadratic;” , “diagLinear;” , “pseudoQuadratic.” (3) denotes the number of trees arbitrarily chosen for AdaBoostM2. (4) Accuracy ( times) denotes the accuracy of the first experiments (, 2, 3, 4, 5).
Figure 6 and Table 4 show that the eight classifiers can better distinguish the uniform regions corresponding to the primary scattering class, such as the urban areas, vegetation, and ocean. The common limitation of these classifiers is that they cannot effectively distinguish vegetation regions from urban areas. In Table 4, the overall precision of the eight classifiers is compared using the training and testing areas (Fig. 4). For the individual classifier, the classification accuracies of the ELM and discriminant analysis classifiers are higher. Generally speaking, the proposed ensemble method outperforms the AdaBoostM2 algorithm with a high accuracy, and it has the best prediction accuracy (98.2305%) of all eight methods. Hence, by combining general machine learning and artificial intelligence classification methods, the terrain classification accuracy of the full PolSAR image can be improved to a certain extent.
In this paper, an evolutionary classification system for terrain classification of PolSAR imagery based on ensemble learning with polarimetric and texture features was proposed. For supervised terrain classification of PolSAR imagery, our classification approach is based on a weighted voting ensemble using the composite of Pauli features and the fifth principal component feature of the Gabor feature reduced in dimension by PCA. The terrain classification accuracy of PolSAR imagery heavily relies on the image features. Our approach thus leverages the texture features in addition to the original polarimetric features.
The Gabor transform method is a standard technique for extracting texture features from remote sensing images. To identify the discrepancies between neighboring pixels, we analyzed different Gabor transform features that are reduced to different dimensions by PCA in different composite approaches with polarimetric features. The most effective combination of features was selected for PolSAR imagery terrain classification.
The proposed method can effectively classify terrain in PolSAR imagery. This is because the five base classifiers for combining the ensemble classifier are distinct from each other and have strong complementarity. Moreover, we can obtain each base classifier weight by a DE algorithm to automatically stop the computation. This capability provides considerable practicability for PolSAR image processing.
Experimental results confirmed that our approach consistently outperformed the existing approaches. Nevertheless, our method is limited in that the terrain classification accuracy may decrease for very complex scenes. The basic reason for this limitation is that the classification method employs a simple selection method of training samples that may be insufficient for providing a reasonably accurate classification for some multiple scenes. Consequently, in future work, we intend to explore a more appropriate algorithm for selecting training samples that can more precisely depict multiple scenes of PolSAR imagery. In addition, we will examine the combination of other known advanced information types to obtain a more accurate classification result.
This work is partially supported by the National Science Foundation of China under Grant No. 61373063 and the Doctoral Foundation of Southwest University of Science and Technology under Grant No. 11zx711901. The author declares that there are no conflicts of interest with the publication of this paper.
Chuanbo Huang is an associate professor of computer engineering with Southwest University of Science and Technology, Mianyang, China. He received his PhD in control science and engineering from Nanjing University of Science and Technology, Nanjing, China, in 2011. His current research interests include image processing, image modeling, segmentation, and tissue characterization, with a particular interest in applications to remote sensing.