期刊论文详细信息
BMC Bioinformatics
CLAG: an unsupervised non hierarchical clustering algorithm handling biological data
Research Article
Alessandra Carbone1  Linda Dib1 
[1] UPMC, UMR7238, Génomique Analytique, 15 rue de l’Ecole de Médecine, F-75006, Paris, France;CNRS, UMR7238, Laboratoire de Génomique des Microorganismes, F-75006, Paris, France;
关键词: Hierarchical Agglomerative Cluster;    Affinity Propagation;    Hierarchical Cluster Algorithm;    Cluster Step;    Breast Cancer Dataset;   
DOI  :  10.1186/1471-2105-13-194
 received in 2012-01-18, accepted in 2012-07-23,  发布年份 2012
来源: Springer
PDF
【 摘 要 】

BackgroundSearching for similarities in a set of biological data is intrinsically difficult due to possible data points that should not be clustered, or that should group within several clusters. Under these hypotheses, hierarchical agglomerative clustering is not appropriate. Moreover, if the dataset is not known enough, like often is the case, supervised classification is not appropriate either.ResultsCLAG (for CLusters AGgregation) is an unsupervised non hierarchical clustering algorithm designed to cluster a large variety of biological data and to provide a clustered matrix and numerical values indicating cluster strength. CLAG clusterizes correlation matrices for residues in protein families, gene-expression and miRNA data related to various cancer types, sets of species described by multidimensional vectors of characters, binary matrices. It does not ask to all data points to cluster and it converges yielding the same result at each run. Its simplicity and speed allows it to run on reasonably large datasets.ConclusionsCLAG can be used to investigate the cluster structure present in biological datasets and to identify its underlying graph. It showed to be more informative and accurate than several known clustering methods, as hierarchical agglomerative clustering, k-means, fuzzy c-means, model-based clustering, affinity propagation clustering, and not to suffer of the convergence problem proper to this latter.

【 授权许可】

CC BY   
© Dib and Carbone; licensee BioMed Central Ltd. 2012

【 预 览 】
附件列表
Files Size Format View
RO202311094679920ZK.pdf 2217KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  文献评价指标  
  下载次数:5次 浏览次数:0次