期刊论文详细信息
BMC Bioinformatics
Data reduction for spectral clustering to analyze high throughput flow cytometry data
Methodology Article
Habil Zare1  Parisa Shooshtari1  Arvind Gupta2  Ryan R Brinkman3 
[1]Department of Computing Science, University of British Columbia, Vancouver, BC, Canada
[2]Terry Fox Laboratory, BC Cancer Agency, 675 W 10th Ave, Vancouver, BC, Canada
[3]Faculty of Science, University of British Columbia, Vancouver, BC, Canada
[4]Terry Fox Laboratory, BC Cancer Agency, 675 W 10th Ave, Vancouver, BC, Canada
[5]Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
关键词: Telomere Length;    Graft Versus Host Disease;    Spectral Cluster;    Flow Cytometry Data;    Spectral Cluster Algorithm;   
DOI  :  10.1186/1471-2105-11-403
 received in 2009-12-21, accepted in 2010-07-28,  发布年份 2010
来源: Springer
PDF
【 摘 要 】
BackgroundRecent biological discoveries have shown that clustering large datasets is essential for better understanding biology in many areas. Spectral clustering in particular has proven to be a powerful tool amenable for many applications. However, it cannot be directly applied to large datasets due to time and memory limitations. To address this issue, we have modified spectral clustering by adding an information preserving sampling procedure and applying a post-processing stage. We call this entire algorithm SamSPECTRAL.ResultsWe tested our algorithm on flow cytometry data as an example of large, multidimensional data containing potentially hundreds of thousands of data points (i.e., "events" in flow cytometry, typically corresponding to cells). Compared to two state of the art model-based flow cytometry clustering methods, SamSPECTRAL demonstrates significant advantages in proper identification of populations with non-elliptical shapes, low density populations close to dense ones, minor subpopulations of a major population and rare populations.ConclusionsThis work is the first successful attempt to apply spectral methodology on flow cytometry data. An implementation of our algorithm as an R package is freely available through BioConductor.
【 授权许可】

Unknown   
© Zare et al; licensee BioMed Central Ltd. 2010. This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

【 预 览 】
附件列表
Files Size Format View
RO202311100778742ZK.pdf 6146KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  • [50]
  文献评价指标  
  下载次数:0次 浏览次数:0次