会议论文详细信息
5th International Conference on Mathematical Modeling in Physical Sciences
Statistical classification techniques in high energy physics (SDDT algorithm)
物理学;数学
Bou, Petr^1 ; Ks, Václav^1 ; Franc, Jií^1
Department of Mathematics, Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Trojanova 13, Praha 2
12000, Czech Republic^1
关键词: Different class;    Divergence measures;    Fermilab Tevatron collider;    Generalized linear model;    Integrated luminosity;    Physical variables;    Separation methods;    Statistical classification techniques;   
Others  :  https://iopscience.iop.org/article/10.1088/1742-6596/738/1/012034/pdf
DOI  :  10.1088/1742-6596/738/1/012034
来源: IOP
PDF
【 摘 要 】

We present our proposal of the supervised binary divergence decision tree with nested separation method based on the generalized linear models. A key insight we provide is the clustering driven only by a few selected physical variables. The proper selection consists of the variables achieving the maximal divergence measure between two different classes. Further, we apply our method to Monte Carlo simulations of physics processes corresponding to a data sample of top quark-antiquark pair candidate events in the lepton+jets decay channel. The data sample is produced in pp collisions at √S = 1.96 TeV. It corresponds to an integrated luminosity of 9.7 fb-1recorded with the D0 detector during Run II of the Fermilab Tevatron Collider. The efficiency of our algorithm achieves 90% AUC in separating signal from background. We also briefly deal with the modification of statistical tests applicable to weighted data sets in order to test homogeneity of the Monte Carlo simulations and measured data. The justification of these modified tests is proposed through the divergence tests.

【 预 览 】
附件列表
Files Size Format View
Statistical classification techniques in high energy physics (SDDT algorithm) 661KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:22次