学位论文详细信息
Towards the integration of genomic profiles and gene interaction networks for machine learning
Drug response;Random Walk with Restart;Network Propagation;PageRank;Embedding
Lin, Henry A ; Han ; Jiawei
关键词: Drug response;    Random Walk with Restart;    Network Propagation;    PageRank;    Embedding;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/90839/LIN-THESIS-2016.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

With the advent of big data, scientists are collecting biological data faster than they have in the past, including genomic profiles which describe individuals by thousands of genes at a time. Adding to this library of knowledge are gene interaction networks, which model overarching cellular processes by describing how genes interact with each other.When approached with genomic profile data together with gene interaction data, it becomes a question of how to integrate these two pieces of knowledge together for machine learning. Previous studies have attempted to employ some form of feature engineering process to "collapse" the network topology alongside the genomic profiles, losing the potential for global network information.Instead, we explore a framework based upon network propagation. We explain how network propagation algorithms can enhance standalone genomic profiles, called embeddings, and show these enhancements lead to improved predictive accuracies on drug response classification. We next show that these embeddings contain predictive signals that are not necessarily implicated by gene ranking methods such as PageRank. Last, we apply network propagation to a dataset presented by the DREAM organization, and show we can improve a naive linear regression that solves for a drug sensitive ranking task.

【 预 览 】
附件列表
Files Size Format View
Towards the integration of genomic profiles and gene interaction networks for machine learning 1035KB PDF download
  文献评价指标  
  下载次数:17次 浏览次数:37次