期刊论文详细信息
BMC Genomics 卷:22
A systematic bioinformatics approach for large-scale identification and characterization of host-pathogen shared sequences
Methodology
Hui San Ong1  Ranjeev Hari1  Asif M. Khan2  Stephen Among James3 
[1] Centre for Bioinformatics, School of Data Sciences, Perdana University, 50490, Damansara Heights, Kuala Lumpur, Malaysia;
[2] Centre for Bioinformatics, School of Data Sciences, Perdana University, 50490, Damansara Heights, Kuala Lumpur, Malaysia;Beykoz Institute of Life Sciences and Biotechnology, Bezmialem Vakif University, Beykoz, 34820, Istanbul, Turkey;
[3] Centre for Bioinformatics, School of Data Sciences, Perdana University, 50490, Damansara Heights, Kuala Lumpur, Malaysia;Department of Biochemistry, Faculty of Science, Kaduna State University, 800211, Kaduna, Nigeria;
关键词: Shared sequences;    Share-ome;    Host-pathogen;    Bioinformatics;    Large-scale;    Methodology;    Flaviviridae;    Flavivirus;    Hepacivirus;    Pegivirus;    Pestivirus;    Dengue virus;    West Nile virus;    Hepatitis C virus;    Cross-reactivity;    Crossreactome;    Peptide sharing;    Peptide overlap;    and Molecular mimicry.;   
DOI  :  10.1186/s12864-021-07657-4
 received in 2021-03-14, accepted in 2021-04-28,  发布年份 2021
来源: Springer
PDF
【 摘 要 】

BackgroundBiology has entered the era of big data with the advent of high-throughput omics technologies. Biological databases provide public access to petabytes of data and information facilitating knowledge discovery. Over the years, sequence data of pathogens has seen a large increase in the number of records, given the relatively small genome size and their important role as infectious and symbiotic agents. Humans are host to numerous pathogenic diseases, such as that by viruses, many of which are responsible for high mortality and morbidity. The interaction between pathogens and humans over the evolutionary history has resulted in sharing of sequences, with important biological and evolutionary implications.ResultsThis study describes a large-scale, systematic bioinformatics approach for identification and characterization of shared sequences between the host and pathogen. An application of the approach is demonstrated through identification and characterization of the Flaviviridae-human share-ome. A total of 2430 nonamers represented the Flaviviridae-human share-ome with 100% identity. Although the share-ome represented a small fraction of the repertoire of Flaviviridae (~ 0.12%) and human (~ 0.013%) non-redundant nonamers, the 2430 shared nonamers mapped to 16,946 Flaviviridae and 7506 human non-redundant protein sequences. The shared nonamer sequences mapped to 125 species of Flaviviridae, including several with unclassified genus. The majority (~ 68%) of the shared sequences mapped to Hepacivirus C species; West Nile, dengue and Zika viruses of the Flavivirus genus accounted for ~ 11%, ~ 7%, and ~ 3%, respectively, of the Flaviviridae protein sequences (16,946) mapped by the share-ome. Further characterization of the share-ome provided important structural-functional insights to Flaviviridae-human interactions.ConclusionMapping of the host-pathogen share-ome has important implications for the design of vaccines and drugs, diagnostics, disease surveillance and the discovery of unknown, potential host-pathogen interactions. The generic workflow presented herein is potentially applicable to a variety of pathogens, such as of viral, bacterial or parasitic origin.

【 授权许可】

CC BY   
© The Author(s) 2021

【 预 览 】
附件列表
Files Size Format View
RO202304226013567ZK.pdf 3604KB PDF download
Fig. 1 481KB Image download
Fig. 6 563KB Image download
Fig. 3 178KB Image download
Fig. 7 1108KB Image download
Fig. 5 867KB Image download
MediaObjects/13046_2023_2665_MOESM1_ESM.xlsx 10KB Other download
MediaObjects/13046_2019_1188_MOESM5_ESM.tif 4113KB Other download
MediaObjects/13750_2022_285_MOESM5_ESM.r 5KB Other download
MediaObjects/13750_2022_285_MOESM7_ESM.docx 47KB Other download
MediaObjects/12864_2021_7657_MOESM3_ESM.xlsx 22KB Other download
【 图 表 】

Fig. 5

Fig. 7

Fig. 3

Fig. 6

Fig. 1

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  • [51]
  • [52]
  • [53]
  • [54]
  • [55]
  • [56]
  • [57]
  • [58]
  • [59]
  • [60]
  • [61]
  • [62]
  • [63]
  • [64]
  • [65]
  • [66]
  • [67]
  • [68]
  • [69]
  • [70]
  • [71]
  • [72]
  • [73]
  • [74]
  • [75]
  • [76]
  • [77]
  • [78]
  • [79]
  • [80]
  • [81]
  • [82]
  • [83]
  • [84]
  • [85]
  • [86]
  • [87]
  • [88]
  • [89]
  • [90]
  • [91]
  • [92]
  • [93]
  • [94]
  • [95]
  文献评价指标  
  下载次数:20次 浏览次数:3次