| Revista Colombiana de Estadística | |
| Three Similarity Measures between One-Dimensional DataSets | |
| FRANCISCO VELASCO MORENTE1  JOSE M. GAVILAN1  LUIS GONZALEZ-ABRIL1  | |
| [1] Universidad de Sevilla; | |
| 关键词: distancia entre intervalos; métodos del núcleo; minería de datos; tests no paramétricos; | |
| DOI : 10.15446/rce.v37n1.44359 | |
| 来源: DOAJ | |
【 摘 要 】
Based on an interval distance, three functions are given in order to quantify similarities between one-dimensional data sets by using first-order statistics. The Glass Identification Database is used to illustrate how to analyse a data set prior to its classification and/or to exclude dimensions. Furthermore, a non-parametric hypothesis test is designed to show how these similarity measures, based on random samples from two populations, can be used to decide whether these populations are identical. Two comparative analyses are also carried out with a parametric test and a non-parametric test. This new non-parametric test performs reasonably well in comparison with classic tests.
【 授权许可】
Unknown