期刊论文详细信息
BioData Mining
Application of a spatially-weighted Relief algorithm for ranking genetic predictors of disease
Matthew E Stokes1  Shyam Visweswaran1 
[1] Department of Biomedical Informatics and the Intelligent Systems Program, University of Pittsburgh, 5607 Baum Boulevard, Pittsburgh, PA, 15206, USA
关键词: Epistasis;    Genetic interactions;    Single nucleotide polymorphisms;    Relief;    Feature selection;   
Others  :  797226
DOI  :  10.1186/1756-0381-5-20
 received in 2011-12-07, accepted in 2012-10-15,  发布年份 2012
PDF
【 摘 要 】

Background

Identification of genetic variants that are associated with disease is an important goal in elucidating the genetic causes of diseases. The genetic patterns that are associated with common diseases are complex and may involve multiple interacting genetic variants. The Relief family of algorithms is a powerful tool for efficiently identifying genetic variants that are associated with disease, even if the variants have nonlinear interactions without significant main effects. Many variations of Relief have been developed over the past two decades and several of them have been applied to single nucleotide polymorphism (SNP) data.

Results

We developed a new spatially weighted variation of Relief called Sigmoid Weighted ReliefF Star (SWRF*), and applied it to synthetic SNP data. When compared to ReliefF and SURF*, which are two algorithms that have been applied to SNP data for identifying interactions, SWRF* had significantly greater power. Furthermore, we developed a framework called the Modular Relief Framework (MoRF) that can be used to develop novel variations of the Relief algorithm, and we used MoRF to develop the SWRF* algorithm.

Conclusions

MoRF allows easy development of new Relief algorithms by specifying different interchangeable functions for the component terms. Using MORF, we developed a new Relief algorithm called SWRF* that had greater ability to identify interacting genetic variants in synthetic data compared to existing Relief algorithms.

【 授权许可】

   
2012 Stokes and Visweswaran.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20140706044018889.pdf 565KB PDF download
Figure 2. 107KB Image download
Figure 1. 21KB Image download
【 图 表 】

Figure 1.

Figure 2.

【 参考文献 】
  • [1]Phuong T, Lin Z, Altman R: Choosing SNPs using feature selection. Proceedings of the IEEE Computational Systems Bioinformatics Conference 2005, 301-309.
  • [2]De Lobel L, Geurts P, Baele G, Castro-Giner F, Kogevinas M, Van Steen K: A screening methodology based on random forests to improve the detection of gene-gene interactions. European Journal of Human Genetics 2010, 18:1127-1132.
  • [3]Kira K, Rendell L: A practical approach to feature selection. In ML92: Proceedings of the ninth international workshop on Machine learning. Morgan Kaufmann Publishers Inc; 1992:249-256.
  • [4]Dietterich TG: Machine learning research: four current directions. AI Magazine 1997, 18:97-136.
  • [5]Robnik-Šikonja M, Kononenko I: Theoretical and empirical analysis of ReliefF and RReliefF. Machine Learning 2003, 53:23-69.
  • [6]Kononenko I: Estimating attributes: analysis and extensions of RELIEF. In Proceedings of the European conference on machine learning on Machine Learning; Catania, Italy. Catania: Springer-Verlag New York, Inc; 1994:171-182.
  • [7]Greene CS, Penrod NM, Kiralis J, Moore JH: Spatially uniform ReliefF (SURF) for computationally-efficient filtering of gene-gene interactions. BioData Mining 2009, 2:5. BioMed Central Full Text
  • [8]Greene C, Himmelstein D, Kiralis J, Moore J: The Informative Extremes: Using Both Nearest and Farthest Individuals Can Improve Relief Algorithms in the Domain of Human Genetics. In Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics. 6023 edition. Edited by Pizzuti C, Ritchie M, Giacobini M. Heidelberg: Springer Berlin; 2010:182-193. Lecture Notes in Computer Science
  • [9]Sun Y: Iterative RELIEF for feature weighting: algorithms, theories, and applications. IEEE Transactions on Pattern Analysis and Machine Intelligence 2007, 29:1035-1051.
  • [10]Moore J, White B: Tuning ReliefF for Genome-Wide Genetic Analysis. In Evolutionary Computation,Machine Learning and Data Mining in Bioinformatics. 4447 edition. Edited by Marchiori E, Moore J, Rajapakse J. Heidelberg: Springer Berlin; 2007:166-175.
  • [11]Wettschereck D, Aha DW, Mohri T: A review and empirical evaluation of feature weighting methods for a class of lazy learning algorithms. Artificial Intelligence Review 1997, 11:273-314.
  • [12]Draper B, Kaito C, Bins J: Iterative Relief. Proceedings of 2003 Conference on Computer Vision and Pattern Recognition Workshop 2003, 6:62-67.
  文献评价指标  
  下载次数:17次 浏览次数:12次