期刊论文详细信息
Mathematical Modelling and Analysis
Accuracy of nonparametric density estimation for univariate Gaussian mixture models: A comparative study
TomasRuzgas1  Jurgita Arnastauskaitė2 
[1] Department of Computer Sciences, Kaunas University of Technology, K. Donelaičio St. 73, 44249 Kaunas, Lithuania;Department of Applied Mathematics, Kaunas University of Technology, K. Donelaičio g. 73, 44249 Kaunas, Lithuania;
关键词: univariate probability density;    nonparametric density estimation;    homogeneity test;    sample clustering;    Monte Carlo method;    municipal solid waste;   
DOI  :  10.3846/mma.2020.10505
来源: DOAJ
【 摘 要 】

Flexible and reliable probability density estimation is fundamental in unsupervised learning and classification. Finite Gaussian mixture models are commonly used for this purpose. However, the parametric form of the distribution is not always known. In this case, non-parametric density estimation methods are used. Usually, these methods become computationally demanding as the number of components increases. In this paper, a comparative study of accuracy of some nonparametric density estimators is made by means of simulation. The following approaches have been considered: an adaptive bandwidth kernel estimator, a projection pursuit estimator, a logspline estimator, and a k-nearest neighbor estimator. It was concluded that data clustering as a pre-processing step improves the estimation of mixture densities. However, in case data does not have clearly defined clusters, the pre-preprocessing step does not give that much of advantage. The application of density estimators is illustrated using municipal solid waste data collected in Kaunas (Lithuania). The data distribution is similar (i.e., with kurtotic unimodal density) to the benchmark distribution introduced by Marron and Wand. Based on the homogeneity tests it can be concluded that distributions of the municipal solid waste fractions in Kutaisi (Georgia), Saint-Petersburg (Russia), and Boryspil (Ukraine) are statistically indifferent compared to the distribution of waste fractions in Kaunas. The distribution of waste data collected in Kaunas (Lithuania) follows the general observations introduced by Marron and Wand (i.e., has one mode and certain kurtosis).

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次