期刊论文详细信息
Journal of Computational Biology
The Complexity of the Dirichlet Model for Multiple Alignment Data
Yi-Kuo Yu1  Stephen F. Altschul1 
[1] National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland.
DOI  :  10.1089/cmb.2011.0039
学科分类:生物科学(综合)
来源: Mary Ann Liebert, Inc. Publishers
PDF
【 摘 要 】

Abstract A model is a set of possible theories for describing a set of data. When the data are used to select a maximum-likelihood theory, an important question is how many effectively independent theories the model contains; the log of this number is called the model's complexity. The Dirichlet model is the set of all Dirichlet distributions, which are probability densities over the space of multinomials. A Dirichlet distribution may be used to describe multiple-alignment data, consisting of n columns of letters, with c letters in each column. We here derive, in the limit of large n and c, a closed-form expression for the complexity of the Dirichlet model applied to such data. For small c, we derive as well a minor correction to this formula, which is easily calculated by Monte Carlo simulation. Although our results are confined to the Dirichlet model, they may cast light as well on the complexity of Dirichlet mixture models, which have been applied fruitfully to the study of protein multiple sequence al..." /> -->

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO201912050577552ZK.pdf 363KB PDF download
  文献评价指标  
  下载次数:1次 浏览次数:1次