期刊论文详细信息
PeerJ
Why choose Random Forest to predict rare species distribution with few samples in large undersampled areas? Three Asian crane species models provide supporting evidence
article
Chunrong Mi1  Falk Huettmann2  Yumin Guo1  Xuesong Han1  Lijia Wen1 
[1] College of Nature Conservation, Beijing Forestry University;EWHALE Lab, Department of Biology and Wildlife, Institute of Arctic Biology, University of Alaska Fairbanks
关键词: Species distribution models (SDMs);    Random Forest;    Generality (transferability);    Rare species;    Undersampled areas;    White-naped crane (Grus vipio);    Black-necked crane (Grus nigricollis);    Hooded crane (Grus monacha);   
DOI  :  10.7717/peerj.2849
学科分类:社会科学、人文和艺术(综合)
来源: Inra
PDF
【 摘 要 】

Species distribution models (SDMs) have become an essential tool in ecology, biogeography, evolution and, more recently, in conservation biology. How to generalize species distributions in large undersampled areas, especially with few samples, is a fundamental issue of SDMs. In order to explore this issue, we used the best available presence records for the Hooded Crane (Grus monacha, n = 33), White-naped Crane (Grus vipio, n = 40), and Black-necked Crane (Grus nigricollis, n = 75) in China as three case studies, employing four powerful and commonly used machine learning algorithms to map the breeding distributions of the three species: TreeNet (Stochastic Gradient Boosting, Boosted Regression Tree Model), Random Forest, CART (Classification and Regression Tree) and Maxent (Maximum Entropy Models). In addition, we developed an ensemble forecast by averaging predicted probability of the above four models results. Commonly used model performance metrics (Area under ROC (AUC) and true skill statistic (TSS)) were employed to evaluate model accuracy. The latest satellite tracking data and compiled literature data were used as two independent testing datasets to confront model predictions. We found Random Forest demonstrated the best performance for the most assessment method, provided a better model fit to the testing data, and achieved better species range maps for each crane species in undersampled areas. Random Forest has been generally available for more than 20 years and has been known to perform extremely well in ecological predictions. However, while increasingly on the rise, its potential is still widely underused in conservation, (spatial) ecological applications and for inference. Our results show that it informs ecological and biogeographical theories as well as being suitable for conservation applications, specifically when the study area is undersampled. This method helps to save model-selection time and effort, and allows robust and rapid assessments and decisions for efficient conservation.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202307100014435ZK.pdf 28856KB PDF download
  文献评价指标  
  下载次数:13次 浏览次数:0次