期刊论文详细信息
Frontiers in Water
Uncertainty quantification of machine learning models to improve streamflow prediction under changing climate and environmental conditions
Water
Siyan Liu1  Dan Lu1  Natalie A. Griffiths2  Eric M. Pierce2  Scott L. Painter2 
[1] Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States;Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States;
关键词: uncertainty quantification;    machine learning;    Long Short-Term Memory networks;    streamflow prediction;    changing climate and environment conditions;   
DOI  :  10.3389/frwa.2023.1150126
 received in 2023-01-23, accepted in 2023-03-30,  发布年份 2023
来源: Frontiers
PDF
【 摘 要 】

Machine learning (ML) models, and Long Short-Term Memory (LSTM) networks in particular, have demonstrated remarkable performance in streamflow prediction and are increasingly being used by the hydrological research community. However, most of these applications do not include uncertainty quantification (UQ). ML models are data driven and can suffer from large extrapolation errors when applied to changing climate/environmental conditions. UQ is required to quantify the influence of data noises on model predictions and avoid overconfident projections in extrapolation. In this work, we integrate a novel UQ method, called PI3NN, with LSTM networks for streamflow prediction. PI3NN calculates Prediction Intervals by training 3 Neural Networks. It can precisely quantify the predictive uncertainty caused by the data noise and identify out-of-distribution (OOD) data in a non-stationary condition to avoid overconfident predictions. We apply the PI3NN-LSTM method in the snow-dominant East River Watershed in the western US and in the rain-driven Walker Branch Watershed in the southeastern US. Results indicate that for the prediction data which have similar features as the training data, PI3NN precisely quantifies the predictive uncertainty with the desired confidence level; and for the OOD data where the LSTM network fails to make accurate predictions, PI3NN produces a reasonably large uncertainty indicating that the results are not trustworthy and should avoid overconfidence. PI3NN is computationally efficient, robust in performance, and generalizable to various network structures and data with no distributional assumptions. It can be broadly applied in ML-based hydrological simulations for credible prediction.

【 授权许可】

Unknown   
Copyright © 2023 Liu, Lu, Painter, Griffiths and Pierce.

【 预 览 】
附件列表
Files Size Format View
RO202310107837525ZK.pdf 2888KB PDF download
  文献评价指标  
  下载次数:3次 浏览次数:1次