Genomics | |
Optimal choice of k-mer in composition vector method for genome sequence comparison | |
Subhram Das^11  Tamal Deb^22  Amira S. Ashour^43  Nilanjan Dey^34  D.K. Bhattacharya^55  D.N. Tibarewala^66  | |
[1] Department of Computer Science and Engineering, Narula Institute of Technology, Kolkata, India^1;Department of Computer Science, Barasat College, Kolkata, India^2;Department of Electronics and Electrical Communications Engineering, Faculty of Engineering, Tanta University, Egypt^4;Department of Information Technology, Techno India College of Technology, Kolkata, India^3;Department of Pure Mathematics, University of Calcutta, Kolkata, India^5;School of Bio-Science and Engineering, Jadavpur University, Kolkata, India^6 | |
关键词: k-mer; Alignment-free techniques; Alignment-based techniques; Composition vector method; Genome sequence comparison; | |
DOI : 10.1016/j.ygeno.2017.11.003 | |
学科分类:医学(综合) | |
来源: Academic Press | |
【 摘 要 】
Several proteins and genes are members of families that share a public evolutionary. In order to outline the evolutionary relationships and to recognize conserved patterns, sequence comparison becomes an emerging process. The current work investigates critically the k-mer role in composition vector method for comparing genome sequences. Generally, composition vector methods using k-mer are applied under choice of different value of k to compare genome sequences. For some values of k, results are satisfactory, but for other values of k, results are unsatisfactory. Standard composition vector method is carried out in the proposed work using 3-mer string length. In addition, special type of information based similarity index is used as a distance measure. It establishes that use of 3-mer and information based similarity index provide satisfactory results especially for comparison of whole genome sequences in all cases. These selections provide a sort of unified approach towards comparison of genome sequences.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201910181468228ZK.pdf | 2084KB | download |