期刊论文详细信息
BMC Public Health
A systematic review of data mining and machine learning for air pollution epidemiology
Research Article
Mohomed Shazan Mohomed Jabbar1  Osmar Zaïane1  Colin Bellinger1  Alvaro Osornio-Vargas2 
[1] Department of Computing Science, University of Alberta, Edmonton, Canada;Department of Paediatrics, University of Alberta, Edmonto, Canada;
关键词: Epidemiology;    Air pollution;    Exposure;    Data mining;    Big data;    Machine learning;    Association mining;   
DOI  :  10.1186/s12889-017-4914-3
 received in 2017-04-26, accepted in 2017-11-14,  发布年份 2017
来源: Springer
PDF
【 摘 要 】

BackgroundData measuring airborne pollutants, public health and environmental factors are increasingly being stored and merged. These big datasets offer great potential, but also challenge traditional epidemiological methods. This has motivated the exploration of alternative methods to make predictions, find patterns and extract information. To this end, data mining and machine learning algorithms are increasingly being applied to air pollution epidemiology.MethodsWe conducted a systematic literature review on the application of data mining and machine learning methods in air pollution epidemiology. We carried out our search process in PubMed, the MEDLINE database and Google Scholar. Research articles applying data mining and machine learning methods to air pollution epidemiology were queried and reviewed.ResultsOur search queries resulted in 400 research articles. Our fine-grained analysis employed our inclusion/exclusion criteria to reduce the results to 47 articles, which we separate into three primary areas of interest: 1) source apportionment; 2) forecasting/prediction of air pollution/quality or exposure; and 3) generating hypotheses. Early applications had a preference for artificial neural networks. In more recent work, decision trees, support vector machines, k-means clustering and the APRIORI algorithm have been widely applied. Our survey shows that the majority of the research has been conducted in Europe, China and the USA, and that data mining is becoming an increasingly common tool in environmental health. For potential new directions, we have identified that deep learning and geo-spacial pattern mining are two burgeoning areas of data mining that have good potential for future applications in air pollution epidemiology.ConclusionsWe carried out a systematic review identifying the current trends, challenges and new directions to explore in the application of data mining methods to air pollution epidemiology. This work shows that data mining is increasingly being applied in air pollution epidemiology.The potential to support air pollution epidemiology continues to grow with advancements in data mining related to temporal and geo-spacial mining, and deep learning. This is further supported by new sensors and storage mediums that enable larger, better quality data. This suggests that many more fruitful applications can be expected in the future.

【 授权许可】

CC BY   
© The Author(s) 2017

【 预 览 】
附件列表
Files Size Format View
RO202311096081645ZK.pdf 896KB PDF download
12864_2015_1944_Article_IEq6.gif 1KB Image download
12864_2017_3492_Article_IEq3.gif 1KB Image download
12864_2015_1944_Article_IEq8.gif 1KB Image download
12864_2016_2580_Article_IEq3.gif 1KB Image download
12864_2017_3920_Article_IEq4.gif 1KB Image download
12711_2017_331_Article_IEq65.gif 1KB Image download
【 图 表 】

12711_2017_331_Article_IEq65.gif

12864_2017_3920_Article_IEq4.gif

12864_2016_2580_Article_IEq3.gif

12864_2015_1944_Article_IEq8.gif

12864_2017_3492_Article_IEq3.gif

12864_2015_1944_Article_IEq6.gif

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  • [51]
  • [52]
  • [53]
  • [54]
  • [55]
  • [56]
  • [57]
  • [58]
  • [59]
  • [60]
  • [61]
  • [62]
  • [63]
  • [64]
  • [65]
  • [66]
  • [67]
  • [68]
  • [69]
  文献评价指标  
  下载次数:16次 浏览次数:4次