Remote Sensing | 卷:9 |
Effect of Training Class Label Noise on Classification Performances for Land Cover Mapping with Satellite Image Time Series | |
Claire Marais Sicre1  Gérard Dedieu1  Charlotte Pelletier1  Silvia Valero1  Jordi Inglada1  Nicolas Champion2  | |
[1] CESBIO—UMR 5126/Université de Toulouse, CNES/CNRS/IRD/UPS, 18 avenue Edouard Belin, 31401 Toulouse CEDEX 9, France; | |
[2] IGN Espace—Université Paris-Est Marne-la-Vallée, LASTIG/MATIS, 73 avenue de Paris, 94160 Saint Mandé, France; | |
关键词: class label noise; mislabeled training data; satellite image time series; classification; land cover mapping; Support Vector Machines; Random Forests; | |
DOI : 10.3390/rs9020173 | |
来源: DOAJ |
【 摘 要 】
Supervised classification systems used for land cover mapping require accurate reference databases. These reference data come generally from different sources such as field measurements, thematic maps, or aerial photographs. Due to misregistration, update delay, or land cover complexity, they may contain class label noise, i.e., a wrong label assignment. This study aims at evaluating the impact of mislabeled training data on classification performances for land cover mapping. Particularly, it addresses the random and systematic label noise problem for the classification of high resolution satellite image time series. Experiments are carried out on synthetic and real datasets with two traditional classifiers: Support Vector Machines (SVM) and Random Forests (RF). A synthetic dataset has been designed for this study, simulating vegetation profiles over one year. The real dataset is composed of Landsat-8 and SPOT-4 images acquired during one year in the south of France. The results show that both classifiers are little influenced for low random noise levels up to 25%–30%, but their performances drop down for higher noise levels. Different classification configurations are tested by increasing the number of classes, using different input feature vectors, and changing the number of training instances. Algorithm complexities are also analyzed. The RF classifier achieves high robustness to random and systematic label noise for all the tested configurations; whereas the SVM classifier is more sensitive to the kernel choice and to the input feature vectors. Finally, this work reveals that the cross-validation procedure is impacted by the presence of class label noise.
【 授权许可】
Unknown