Davari, Amirabbas and Aptoula, Erchan and Yanıkoğlu, Berrin and Maier, Andreas and Riess, Christian (2018) GMM-based synthetic samples for classification of hyperspectral images with limited training data. IEEE Geoscience and Remote Sensing Letters, 15 (6). pp. 942-946. ISSN 1545-598X (Print) 1558-0571 (Online)
Full text not available from this repository. (Request a copy)
Official URL: http://dx.doi.org/10.1109/LGRS.2018.2817361
Abstract
The amount of training data that is required to train a classifier scales with the dimensionality of the feature data. In hyperspectral remote sensing (HSRS), feature data can potentially become very high dimensional. However, the amount of training data is oftentimes limited. Thus, one of the core challenges in HSRS is how to perform multiclass classification using only relatively few training data points. In this letter, we address this issue by enriching the feature matrix with synthetically generated sample points. These synthetic data are sampled from a Gaussian mixture model (GMM) fitted to each class of the limited training data. Although the true distribution of features may not be perfectly modeled by the fitted GMM, we demonstrate that a moderate augmentation by these synthetic samples can effectively replace a part of the missing training samples. Doing so, the median gain in classification performance is 5% on two datasets. This performance gain is stable for variations in the number of added samples, which makes it easy to apply this method to real-world applications.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Hyperspectral remote sensing (HSRS) image classification; limited training data; synthetic data |
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng. Faculty of Engineering and Natural Sciences |
Depositing User: | Berrin Yanıkoğlu |
Date Deposited: | 06 Sep 2018 15:47 |
Last Modified: | 15 Jun 2023 14:53 |
URI: | https://research.sabanciuniv.edu/id/eprint/36552 |