期刊论文详细信息
BMC Bioinformatics
Model-based clustering for flow and mass cytometry data with clinical information
Yuka Maeda1  Hiroyoshi Nishikawa2  Teppei Shimamura3  Ko Abe3  Kodai Minoura4 
[1] Division of Cancer Immunology, Research Institute/EPOC, National Cancer Center, Chuo-ku tsukiji 5-1-1/Kashiwa-shi kashiwanoha 6-5-1, 1040045/2778577, Tokyo/Chiba, Japan;Division of Immunology, Nagoya University Graduate School of Medicine, 65 Tsurumai-cho, Showa-ku, 4668550, Nagoya, Japan;Division of Cancer Immunology, Research Institute/EPOC, National Cancer Center, Chuo-ku tsukiji 5-1-1/Kashiwa-shi kashiwanoha 6-5-1, 1040045/2778577, Tokyo/Chiba, Japan;Division of Systems Biology, Nagoya University Graduate School of Medicine, 65 Tsurumai-cho, Showa-ku, 4668550, Nagoya, Japan;Division of Systems Biology, Nagoya University Graduate School of Medicine, 65 Tsurumai-cho, Showa-ku, 4668550, Nagoya, Japan;Division of Immunology, Nagoya University Graduate School of Medicine, 65 Tsurumai-cho, Showa-ku, 4668550, Nagoya, Japan;
关键词: Flow cytomety;    Mass cytometory;    Bayesian mixture model;    Stochastic EM algorithm;   
DOI  :  10.1186/s12859-020-03671-7
来源: Springer
PDF
【 摘 要 】

BackgroundHigh-dimensional flow cytometry and mass cytometry allow systemic-level characterization of more than 10 protein profiles at single-cell resolution and provide a much broader landscape in many biological applications, such as disease diagnosis and prediction of clinical outcome. When associating clinical information with cytometry data, traditional approaches require two distinct steps for identification of cell populations and statistical test to determine whether the difference between two population proportions is significant. These two-step approaches can lead to information loss and analysis bias.ResultsWe propose a novel statistical framework, called LAMBDA (Latent Allocation Model with Bayesian Data Analysis), for simultaneous identification of unknown cell populations and discovery of associations between these populations and clinical information. LAMBDA uses specified probabilistic models designed for modeling the different distribution information for flow or mass cytometry data, respectively. We use a zero-inflated distribution for the mass cytometry data based the characteristics of the data. A simulation study confirms the usefulness of this model by evaluating the accuracy of the estimated parameters. We also demonstrate that LAMBDA can identify associations between cell populations and their clinical outcomes by analyzing real data. LAMBDA is implemented in R and is available from GitHub (https://github.com/abikoushi/lambda).

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202104241628008ZK.pdf 2260KB PDF download
  文献评价指标  
  下载次数:13次 浏览次数:9次