期刊论文详细信息
Frontiers in Molecular Biosciences
Understanding structure-guided variant effect predictions using 3D convolutional neural networks
Molecular Biosciences
Bas Vroling1  Stephan Heijl1  Gayatri Ramakrishnan2  Martijn A. Huynen2  Coos Baakman2  Li C. Xue2  Ragna van Horck3  Jeffrey Hiraki3 
[1] Bio-Prodict, Nijmegen, Netherlands;Department of Medical Biosciences, Radboud University Medical Center, Nijmegen, Netherlands;Vartion, Malden, Netherlands;
关键词: protein structure;    3D CNN;    missense variant;    machine learning;    gain-of-function;    loss-of-function;   
DOI  :  10.3389/fmolb.2023.1204157
 received in 2023-04-11, accepted in 2023-06-22,  发布年份 2023
来源: Frontiers
PDF
【 摘 要 】

Predicting pathogenicity of missense variants in molecular diagnostics remains a challenge despite the available wealth of data, such as evolutionary information, and the wealth of tools to integrate that data. We describe DeepRank-Mut, a configurable framework designed to extract and learn from physicochemically relevant features of amino acids surrounding missense variants in 3D space. For each variant, various atomic and residue-level features are extracted from its structural environment, including sequence conservation scores of the surrounding amino acids, and stored in multi-channel 3D voxel grids which are then used to train a 3D convolutional neural network (3D-CNN). The resultant model gives a probabilistic estimate of whether a given input variant is disease-causing or benign. We find that the performance of our 3D-CNN model, on independent test datasets, is comparable to other widely used resources which also combine sequence and structural features. Based on the 10-fold cross-validation experiments, we achieve an average accuracy of 0.77 on the independent test datasets. We discuss the contribution of the variant neighborhood in the model’s predictive power, in addition to the impact of individual features on the model’s performance. Two key features: evolutionary information of residues in the variant neighborhood and their solvent accessibilities were observed to influence the predictions. We also highlight how predictions are impacted by the underlying disease mechanisms of missense mutations and offer insights into understanding these to improve pathogenicity predictions. Our study presents aspects to take into consideration when adopting deep learning approaches for protein structure-guided pathogenicity predictions.

【 授权许可】

Unknown   
Copyright © 2023 Ramakrishnan, Baakman, Heijl, Vroling, van Horck, Hiraki, Xue and Huynen.

【 预 览 】
附件列表
Files Size Format View
RO202310100928233ZK.pdf 2717KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:0次