期刊论文详细信息
Genetics and Molecular Biology
Evaluation of noise reduction techniques in the splice junction recognition problem
Ana C. Lorena1  André C. P. L. F. De Carvalho1 
[1] ,Universidade de São Paulo Instituto de Ciências Matemáticas e de Computação Laboratório de Computação BioinspiradaSão Carlos São Paulo ,Brazil
关键词: pre-processing;    machine learning;    splice junction recognition;   
DOI  :  10.1590/S1415-47572004000400031
来源: SciELO
PDF
【 摘 要 】

The Human Genome Project has generated a large amount of sequence data. A number of works are currently concerned with analyzing these data. One of the analyses carried out is the identification of genes' structures on the sequences obtained. As such, one can search for particular signals associated with gene expression. Splice junctions represent a type of signal present on eukaryote genes. Many studies have applied Machine Learning techniques in the recognition of such regions. However, most of the genetic databases are characterized by the presence of noisy data, which can affect the performance of the learning techniques. This paper evaluates the effectiveness of five data pre-processing algorithms in the elimination of noisy instances from two splice junction recognition datasets. After the pre-processing phase, two learning techniques, Decision Trees and Support Vector Machines, are employed in the recognition process.

【 授权许可】

CC BY   
 All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License

【 预 览 】
附件列表
Files Size Format View
RO202005130147379ZK.pdf 513KB PDF download
  文献评价指标  
  下载次数:15次 浏览次数:10次