期刊论文详细信息
BMC Bioinformatics
optimalFlow: optimal transport approach to flow cytometry gating and population matching
Hristo Inouzhe1  Agustín Mayo-Íscar1  Eustasio del Barrio1  Carlos Matrán1  Jean-Michel Loubes2 
[1] Departamento de Estadística e Investigación Operativa, Universidad de Valladolid, Calle Paseo de Belén, Valladolid, Spain;IMUVA, Calle Paseo de Belén, Valladolid, Spain;Université Paul Sabatier, Route de Narbonne, Toulouse, France;IMT, Route de Narbonne, Toulouse, France;
关键词: Flow cytometry gating;    Optimal transport;    Wasserstein distance;    Clustering;    Supervised classification;   
DOI  :  10.1186/s12859-020-03795-w
来源: Springer
PDF
【 摘 要 】

BackgroundData obtained from flow cytometry present pronounced variability due to biological and technical reasons. Biological variability is a well-known phenomenon produced by measurements on different individuals, with different characteristics such as illness, age, sex, etc. The use of different settings for measurement, the variation of the conditions during experiments and the different types of flow cytometers are some of the technical causes of variability. This mixture of sources of variability makes the use of supervised machine learning for identification of cell populations difficult. The present work is conceived as a combination of strategies to facilitate the task of supervised gating.ResultsWe propose optimalFlowTemplates, based on a similarity distance and Wasserstein barycenters, which clusters cytometries and produces prototype cytometries for the different groups. We show that supervised learning, restricted to the new groups, performs better than the same techniques applied to the whole collection. We also present optimalFlowClassification, which uses a database of gated cytometries and optimalFlowTemplates to assign cell types to a new cytometry. We show that this procedure can outperform state of the art techniques in the proposed datasets. Our code is freely available as optimalFlow, a Bioconductor R package at https://bioconductor.org/packages/optimalFlow.ConclusionsoptimalFlowTemplates + optimalFlowClassification addresses the problem of using supervised learning while accounting for biological and technical variability. Our methodology provides a robust automated gating workflow that handles the intrinsic variability of flow cytometry data well. Our main innovation is the methodology itself and the optimal transport techniques that we apply to flow cytometry analysis.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202104270870142ZK.pdf 2607KB PDF download
  文献评价指标  
  下载次数:0次 浏览次数:6次