期刊论文详细信息
Journal of Computer Science
Taxonomically Clustering Organisms Based on the Profiles of Gene Sequences Using PCA | Science Publications
M. Punithavalli1  E. Ramaraj1 
关键词: Bioinformatics;    taxonomy;    gene sequence classification;    data mining;    data classification;    clustering;    principal component analysis;   
DOI  :  10.3844/jcssp.2006.292.296
学科分类:计算机科学(综合)
来源: Science Publications
PDF
【 摘 要 】

The biological implications of bioinformatics can already be seen in various implementations. Biological taxonomy may seem like a simple science in which the biologists merely observe similarities among organisms and construct classifications according to those similarities[1], but it is not so simple. By applying data mining techniques on gene sequence database we can cluster the data to find interesting similarities in the gene expression data. One of the applications of such kind of clustering is taxonomically clustering the organisms based on their gene sequential expressions. In this study we outlined a method for taxonomical clustering of species of the organisms based on the genetic profile using Principal Component Analysis and Self Organizing Neural Networks. We have implemented the idea using Matlab and tried to cluster the gene sequences taken from PAUP version of the ML5/ML6 database. The taxa used for some of the basidiomycetous fungi form the database. To study the scalability issues another large gene sequence database was used. The proposed method clustered the species of organisms correctly in almost all the cases. The obtained were more significant and promising. The proposed method clustered the species of organisms correctly in almost all the cases. The obtained results were more significant and promising.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201911300818157ZK.pdf 67KB PDF download
  文献评价指标  
  下载次数:15次 浏览次数:16次