期刊论文详细信息
BMC Genomics
Staged heterogeneity learning to identify conformational B-cell epitopes from antigen sequences
Research
Jinyan Li1  Jing Ren2  Jiangning Song3  John Ellis4 
[1] Advanced Analytics Institute and Centre for Health Technologies, Faculty of Engineering and Information Technology, University of Technology Sydney, NSW 2007, Ultimo, Australia;Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, NSW 2007, Ultimo, Australia;College of Computer, National University of Defense Technology, 410073, Changsha, China;Monash Centre for Data Science, Faculty of Information Technology, Monash University, VIC 3800, Melbourne, Australia;Infection and Immunity Program, Biomedicine Discovery Institute, Monash University, VIC 3800, Melbourne, Australia;School of Life Sciences, University of Technology Sydney, NSW 2007, Ultimo, Australia;
关键词: Staged heterogeneity learning;    Conformational epitope;    B-cell epitope;    Epitope prediction;    Sequence-based;   
DOI  :  10.1186/s12864-017-3493-0
来源: Springer
PDF
【 摘 要 】

BackgroundThe broad heterogeneity of antigen-antibody interactions brings tremendous challenges to the design of a widely applicable learning algorithm to identify conformational B-cell epitopes. Besides the intrinsic heterogeneity introduced by diverse species, extra heterogeneity can also be introduced by various data sources, adding another layer of complexity and further confounding the research.ResultsThis work proposed a staged heterogeneity learning method, which learns both characteristics and heterogeneity of data in a phased manner. The method was applied to identify antigenic residues of heterogenous conformational B-cell epitopes based on antigen sequences. In the first stage, the model learns the general epitope patterns of each kind of propensity from a large data set containing computationally defined epitopes. In the second stage, the model learns the heterogenous complementarity of these propensities from a relatively small guided data set containing experimentally determined epitopes. Moreover, we designed an algorithm to cluster the predicted individual antigenic residues into conformational B-cell epitopes so as to provide strong potential for real-world applications, such as vaccine development. With heterogeneity well learnt, the transferability of the prediction model was remarkably improved to handle new data with a high level of heterogeneity. The model has been tested on two data sets with experimentally determined epitopes, and on a data set with computationally defined epitopes. This proposed sequence-based method achieved outstanding performance - about twice that of existing methods, including the sequence-based predictor CBTOPE and three other structure-based predictors.ConclusionsThe proposed method uses only antigen sequence information, and thus has much broader applications.

【 授权许可】

CC BY   
© The Author(s) 2017

【 预 览 】
附件列表
Files Size Format View
RO202311101235852ZK.pdf 2746KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  文献评价指标  
  下载次数:4次 浏览次数:1次