期刊论文详细信息
JOURNAL OF MULTIVARIATE ANALYSIS 卷:146
On the asymptotics of random forests
Article
Scornet, Erwan1 
[1] Univ Paris 06, Sorbonne Univ, F-75005 Paris, France
关键词: Random forests;    Randomization;    Consistency;    Central limit theorem;    Empirical process;    Number of trees;    q-quantile;   
DOI  :  10.1016/j.jmva.2015.06.009
来源: Elsevier
PDF
【 摘 要 】

The last decade has witnessed a growing interest in random forest models which are recognized to exhibit good practical performance, especially in high-dimensional settings. On the theoretical side, however, their predictive power remains largely unexplained, thereby creating a gap between theory and practice. In this paper, we present some asymptotic results on random forests in a regression framework. Firstly, we provide theoretical guarantees to link finite forests used in practice (with a finite number M of trees) to their asymptotic counterparts (with M = infinity). Using empirical process theory, we prove a uniform central limit theorem for a large class of random forest estimates, which holds in particular for Breiman's (2001) original forests. Secondly, we show that infinite forest consistency implies finite forest consistency and thus, we state the consistency of several infinite forests. In particular, we prove that q quantile forests - close in spirit to Breiman's (2001) forests but easier to study - are able to combine inconsistent trees to obtain a final consistent prediction, thus highlighting the benefits of random forests compared to single trees. (C) 2015 Elsevier Inc. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_jmva_2015_06_009.pdf 374KB PDF download
  文献评价指标  
  下载次数:1次 浏览次数:0次