Applied Sciences | |
Machine Learning Models of COVID-19 Cases in the United States: A Study of Initial Lockdown and Reopen Regimes | |
Yudan Ding1  Zhenzhen Qu1  Chenchen Zhang1  Arnold Kamis1  | |
[1] International Business School, Brandeis University, Waltham, MA 02453, USA; | |
关键词: COVID-19; pandemic; regime; lockdowns; machine learning; ensemble model; | |
DOI : 10.3390/app112311227 | |
来源: DOAJ |
【 摘 要 】
The purpose of this paper is to model the cases of COVID-19 in the United States from 13 March 2020 to 31 May 2020. Our novel contribution is that we have obtained highly accurate models focused on two different regimes, lockdown and reopen, modeling each regime separately. The predictor variables include aggregated individual movement as well as state population density, health rank, climate temperature, and political color. We apply a variety of machine learning methods to each regime: Multiple Regression, Ridge Regression, Elastic Net Regression, Generalized Additive Model, Gradient Boosted Machine, Regression Tree, Neural Network, and Random Forest. We discover that Gradient Boosted Machines are the most accurate in both regimes. The best models achieve a variance explained of 95.2% in the lockdown regime and 99.2% in the reopen regime. We describe the influence of the predictor variables as they change from regime to regime. Notably, we identify individual person movement, as tracked by GPS data, to be an important predictor variable. We conclude that government lockdowns are an extremely important de-densification strategy. Implications and questions for future research are discussed.
【 授权许可】
Unknown