期刊论文详细信息
BMC Bioinformatics
Walk-weighted subsequence kernels for protein-protein interaction extraction
Research Article
Juntae Yoon1  Seog Park2  Seonho Kim2  Jihoon Yang2 
[1] Daumsoft Inc, Se-Ah Venture Tower, Seoul, Korea;Department of Computer Science, Sogang University, Seoul, Korea;
关键词: Short Path;    Relation Extraction;    Syntactic Information;    Syntactic Relation;    String Kernel;   
DOI  :  10.1186/1471-2105-11-107
 received in 2009-04-22, accepted in 2010-02-25,  发布年份 2010
来源: Springer
PDF
【 摘 要 】

BackgroundThe construction of interaction networks between proteins is central to understanding the underlying biological processes. However, since many useful relations are excluded in databases and remain hidden in raw text, a study on automatic interaction extraction from text is important in bioinformatics field.ResultsHere, we suggest two kinds of kernel methods for genic interaction extraction, considering the structural aspects of sentences. First, we improve our prior dependency kernel by modifying the kernel function so that it can involve various substructures in terms of (1) e-walks, (2) partial match, (3) non-contiguous paths, and (4) different significance of substructures. Second, we propose the walk-weighted subsequence kernel to parameterize non-contiguous syntactic structures as well as semantic roles and lexical features, which makes learning structural aspects from a small amount of training data effective. Furthermore, we distinguish the significances of parameters such as syntactic locality, semantic roles, and lexical features by varying their weights.ConclusionsWe addressed the genic interaction problem with various dependency kernels and suggested various structural kernel scenarios based on the directed shortest dependency path connecting two entities. Consequently, we obtained promising results over genic interaction data sets with the walk-weighted subsequence kernel. The results are compared using automatically parsed third party protein-protein interaction (PPI) data as well as perfectly syntactic labeled PPI data.

【 授权许可】

CC BY   
© Kim et al; licensee BioMed Central Ltd. 2010

【 预 览 】
附件列表
Files Size Format View
RO202311101824956ZK.pdf 3696KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  文献评价指标  
  下载次数:9次 浏览次数:0次