期刊论文详细信息
Environmental Health
Classification and regression trees for epidemiologic research: an air pollution example
Methodology
Howard H Chang1  Mitch Klein2  Matthew J Strickland2  Katherine Gass3  W Dana Flanders3 
[1]Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, 1518 Clifton Rd, 30322, Atlanta, GA, USA
[2]Department of Environmental Health, Rollins School of Public Health, Emory University, 1518 Clifton Rd, 30322, Atlanta, GA, USA
[3]Department of Epidemiology, Rollins School of Public Health, Emory University, 1518 Clifton Rd, 30322, Atlanta, GA, USA
关键词: Air pollution;    CART;    Classification and regression trees;    Multipollutant;    Mixtures;    Pediatric asthma;   
DOI  :  10.1186/1476-069X-13-17
 received in 2013-11-12, accepted in 2014-03-07,  发布年份 2014
来源: Springer
PDF
【 摘 要 】
BackgroundIdentifying and characterizing how mixtures of exposures are associated with health endpoints is challenging. We demonstrate how classification and regression trees can be used to generate hypotheses regarding joint effects from exposure mixtures.MethodsWe illustrate the approach by investigating the joint effects of CO, NO2, O3, and PM2.5 on emergency department visits for pediatric asthma in Atlanta, Georgia. Pollutant concentrations were categorized as quartiles. Days when all pollutants were in the lowest quartile were held out as the referent group (n = 131) and the remaining 3,879 days were used to estimate the regression tree. Pollutants were parameterized as dichotomous variables representing each ordinal split of the quartiles (e.g. comparing CO quartile 1 vs. CO quartiles 2–4) and considered one at a time in a Poisson case-crossover model with control for confounding. The pollutant-split resulting in the smallest P- value was selected as the first split and the dataset was partitioned accordingly. This process repeated for each subset of the data until the P- values for the remaining splits were not below a given alpha, resulting in the formation of a “terminal node”. We used the case-crossover model to estimate the adjusted risk ratio for each terminal node compared to the referent group, as well as the likelihood ratio test for the inclusion of the terminal nodes in the final model.ResultsThe largest risk ratio corresponded to days when PM2.5 was in the highest quartile and NO2 was in the lowest two quartiles (RR: 1.10, 95% CI: 1.05, 1.16). A simultaneous Wald test for the inclusion of all terminal nodes in the model was significant, with a chi-square statistic of 34.3 (p = 0.001, with 13 degrees of freedom).ConclusionsRegression trees can be used to hypothesize about joint effects of exposure mixtures and may be particularly useful in the field of air pollution epidemiology for gaining a better understanding of complex multipollutant exposures.
【 授权许可】

CC BY   
© Gass et al.; licensee BioMed Central Ltd. 2014

【 预 览 】
附件列表
Files Size Format View
RO202311109044003ZK.pdf 627KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  文献评价指标  
  下载次数:2次 浏览次数:1次