期刊论文详细信息
BMC Bioinformatics
Identifying structural domains of proteins using clustering
Methodology Article
Howard J Feldman1 
[1] Chemical Computing Group, Inc., 1010 Sherbrooke St. W., Suite 910, H3A 2R7, Montreal, Quebec, Canada;
关键词: Domain assignment;    Agglomerative clustering;    Average-linkage;    Structural domain;   
DOI  :  10.1186/1471-2105-13-286
 received in 2012-01-26, accepted in 2012-10-29,  发布年份 2012
来源: Springer
PDF
【 摘 要 】

BackgroundProtein structures are comprised of modular elements known as domains. These units are used and re-used over and over in nature, and usually serve some particular function in the structure. Thus it is useful to be able to break up a protein of interest into its component domains, prior to similarity searching for example. Numerous computational methods exist for doing so, but most operate only on a single protein chain and many are limited to making a series of cuts to the sequence, while domains can and do span multiple chains.ResultsThis study presents a novel clustering-based approach to domain identification, which works equally well on individual chains or entire complexes. The method is simple and fast, taking only a few milliseconds to run, and works by clustering either vectors representing secondary structure elements, or buried alpha-carbon positions, using average-linkage clustering. Each resulting cluster corresponds to a domain of the structure. The method is competitive with others, achieving 70% agreement with SCOP on a large non-redundant data set, and 80% on a set more heavily weighted in multi-domain proteins on which both SCOP and CATH agree.ConclusionsIt is encouraging that a basic method such as this performs nearly as well or better than some far more complex approaches. This suggests that protein domains are indeed for the most part simply compact regions of structure with a higher density of buried contacts within themselves than between each other. By representing the structure as a set of points or vectors in space, it allows us to break free of any artificial limitations that other approaches may depend upon.

【 授权许可】

CC BY   
© Feldman; licensee BioMed Central Ltd. 2012

【 预 览 】
附件列表
Files Size Format View
RO202311098987104ZK.pdf 1213KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  文献评价指标  
  下载次数:6次 浏览次数:2次