期刊论文详细信息
Frontiers in Genetics
Interpretable Machine Learning Reveals Dissimilarities Between Subtypes of Autism Spectrum Disorder
Karolina Smolinska2  Mateusz Garbulowski2  Lars Feuk3  Klev Diamanti3  Khurram Maqbool3  Gang Pan3  Jan Komorowski5 
[1] Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland;Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden;Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden;Swedish Collegium for Advanced Study, Uppsala, Sweden;Washington National Primate Research Center, Seattle, WA, United States;
关键词: autism spectrum disorder;    interpretable machine learning;    transcriptomics;    rule-based classification;    autism spectrum disorder subtypes;    data integration;   
DOI  :  10.3389/fgene.2021.618277
来源: DOAJ
【 摘 要 】

Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear methods that provide interpretable solutions. Interpretable machine learning provides legible models that allow explaining biological mechanisms and support analysis of clinical subgroups. In this work, we investigated several case-control studies of gene expression measurements of ASD individuals. We constructed a rule-based learning model from three independent datasets that we further visualized as a nonlinear gene-gene co-predictive network. To find dissimilarities between ASD subtypes, we scrutinized a topological structure of the network and estimated a centrality distance. Our analysis revealed that autism is the most severe subtype of ASD, while pervasive developmental disorder-not otherwise specified and Asperger syndrome are closely related and milder ASD subtypes. Furthermore, we analyzed the most important ASD-related features that were described in terms of gene co-predictors. Among others, we found a strong co-predictive mechanism between EMC4 and TMEM30A, which may suggest a co-regulation between these genes. The present study demonstrates the potential of applying interpretable machine learning in bioinformatics analyses. Although the proposed methodology was designed for transcriptomics data, it can be applied to other omics disciplines.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次