期刊论文详细信息
Journal of computational biology
Reconstructing Genotypes in Private Genomic Databases from Genetic Risk Scores
article
Brooks Paige1  James Bell1  Aurélien Bellet3  Adrià Gascón1  Daphne Ezer1 
[1] The Alan Turing Institute, United Kingdom;Department of Computer Science, University College London, United Kingdom;Inria, Parc Scientifique de la Haute Borne Park Plaza, Villeneuve d’Ascq;University of Warwick, United Kingdom;Department of Biology, University of York, United Kingdom
关键词: genetic risk scores;    genomic privacy;    GWAS;    long-term privacy;    reconstruction attack.;   
DOI  :  10.1089/cmb.2020.0445
来源: Mary Ann Liebert, Inc. Publishers
PDF
【 摘 要 】

Some organizations such as 23andMe and the UK Biobank have large genomic databases that they re-use for multiple different genome-wide association studies. Even research studies that compile smaller genomic databases often utilize these databases to investigate many related traits. It is common for the study to report a genetic risk score (GRS) model for each trait within the publication. Here, we show that under some circumstances, these GRS models can be used to recover the genetic variants of individuals in these genomic databases—a reconstruction attack. In particular, if two GRS models are trained by using a largely overlapping set of participants, it is often possible to determine the genotype for each of the individuals who were used to train one GRS model, but not the other. We demonstrate this theoretically and experimentally by analyzing the Cornell Dog Genome database. The accuracy of our reconstruction attack depends on how accurately we can estimate the rate of co-occurrence of pairs of single nucleotide polymorphisms within the private database, so if this aggregate information is ever released, it would drastically reduce the security of a private genomic database. Caution should be applied when using the same database for multiple analysis, especially when a small number of individuals are included or excluded from one part of the study.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO202108110003438ZK.pdf 22282KB PDF download
  文献评价指标  
  下载次数:15次 浏览次数:0次