期刊论文详细信息
BMC Medical Genomics
Predicting gene knockout effects from expression data
Research
Sagiv Shifman1  Jonathan Rosenski2  Tommy Kaplan3 
[1] Department of Genetics, The Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel;School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel;School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel;Department of Developmental Biology and Cancer Research, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel;
关键词: Gene essentiality;    Computational biology;    Machine learning;   
DOI  :  10.1186/s12920-023-01446-6
 received in 2022-08-29, accepted in 2023-01-27,  发布年份 2023
来源: Springer
PDF
【 摘 要 】

BackgroundThe study of gene essentiality, which measures the importance of a gene for cell division and survival, is used for the identification of cancer drug targets and understanding of tissue-specific manifestation of genetic conditions. In this work, we analyze essentiality and gene expression data from over 900 cancer lines from the DepMap project to create predictive models of gene essentiality.MethodsWe developed machine learning algorithms to identify those genes whose essentiality levels are explained by the expression of a small set of “modifier genes”. To identify these gene sets, we developed an ensemble of statistical tests capturing linear and non-linear dependencies. We trained several regression models predicting the essentiality of each target gene, and used an automated model selection procedure to identify the optimal model and hyperparameters. Overall, we examined linear models, gradient boosted trees, Gaussian process regression models, and deep learning networks.ResultsWe identified nearly 3000 genes for which we accurately predict essentiality using gene expression data of a small set of modifier genes. We show that both in the number of genes we successfully make predictions for, as well as in the prediction accuracy, our model outperforms current state-of-the-art works.ConclusionsOur modeling framework avoids overfitting by identifying the small set of modifier genes, which are of clinical and genetic importance, and ignores the expression of noisy and irrelevant genes. Doing so improves the accuracy of essentiality prediction in various conditions and provides interpretable models. Overall, we present an accurate computational approach, as well as interpretable modeling of essentiality in a wide range of cellular conditions, thus contributing to a better understanding of the molecular mechanisms that govern tissue-specific effects of genetic disease and cancer.

【 授权许可】

CC BY   
© The Author(s) 2023

【 预 览 】
附件列表
Files Size Format View
RO202305157124522ZK.pdf 1995KB PDF download
Fig. 3 101KB Image download
Fig. 1 161KB Image download
Fig. 2 2875KB Image download
Fig. 3 212KB Image download
Fig. 1 27KB Image download
MediaObjects/12888_2023_4572_MOESM1_ESM.docx 33KB Other download
【 图 表 】

Fig. 1

Fig. 3

Fig. 2

Fig. 1

Fig. 3

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  文献评价指标  
  下载次数:2次 浏览次数:0次