期刊论文详细信息
Journal of biosciences
DNA pattern recognition using canonical correlation algorithm
B K Sarkar2  Chiranjib Chakraborty21 
[1] Department of Bioinformatics, School of Computer Sciences, Galgotias University, Greater Noida, India$$;Department of Physics, School of Basic & Applied Sciences, Galgotias University, Greater Noida, India$$
关键词: Canonical correlation analysis;    DNA sequence;    pattern recognition;   
DOI  :  
来源: Indian Academy of Sciences
PDF
【 摘 要 】

We performed canonical correlation analysis as an unsupervised statistical tool to describe related views of the same semantic object for identifying patterns. A pattern recognition technique based on canonical correlation analysis (CCA) was proposed for finding required genetic code in the DNA sequence. Two related but different objects were considered: one was a particular pattern, and other was test DNA sequence. CCA found correlations between two observations of the same semantic pattern and test sequence. It is concluded that the relationship possesses maximum value in the position where the pattern exists. As a case study, the potential of CCA was demonstrated on the sequence found from HIV-1 preferred integration sites. The subsequences on the left and right flanking from the integration site were considered as the two views, and statistically significant relationships were established between these two views to elucidate the viral preference as an important factor for the correlation.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201912040495433ZK.pdf 782KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:12次