学位论文详细信息
Extraction and identification of frequent sequential patterns in transcription factor binding site organization of enhancers
Frequent Sequential Pattern Mining (FSPM);Data Mining;Bioinformatics;Enhancer;Transcription Factor (TF);Transcription Factor Binding Site (TFBS);Transcription;Regulation;Genomics
Bissonnette, Paul ; Ma ; Jian
关键词: Frequent Sequential Pattern Mining (FSPM);    Data Mining;    Bioinformatics;    Enhancer;    Transcription Factor (TF);    Transcription Factor Binding Site (TFBS);    Transcription;    Regulation;    Genomics;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/49794/Paul_Bissonnette.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

The advent of laboratory techniques to assess protein-DNA interactions, and chromatin post-translational modifications has created vast arrays of data correlating binding sites and regulatory elements with genome regions. There has been great interest in developing new computational approaches that leverage these annotations in order to better understand the layout and organization of eukaryotic genomes and gene regulation. In particular, while tools exist to accurately map the coding regions of the genome, few methods exist to leverage new annotations in the non-coding regions of the genome that may hold the key to many important questions about gene regulation. By exploiting the results of large scale annotation studies such as ENCODE to identify cell-type specific enhancers, and transcription factor binding sites, it is possible to better understand enhancer regions of the genome responsible for promoting transcriptional activity. Using frequent sequential pattern mining techniques from the classical data mining field, the importance of linear order in enhancer role and structure was explored in this thesis. Common orderings of binding sites within enhancers were identified across nine cell lines. Putative targets of enhancers exhibiting these patterns were then determined and clustered based on functional classification. Examination of detected patterns indicated that while the choice of transcription factors in the pattern often correlated with the overall function of putative targets, the ordering of these binding sites might be an effective classifier of more specific functional activity. Additionally, findings suggested that the arrangement of binding sites within enhancers was more likely to be cell specific than the transcription factor binding sites themselves. The knowledge that binding site pattern is more strongly linked to target function than binding site co-association alone could lead to important advances in our understanding of the diverse regulatory roles of different enhancers as well as new functional annotations for enhancers, genes, and transcription factors that have not yet been the focus of intensive studies to elucidate their roles.

【 预 览 】
附件列表
Files Size Format View
Extraction and identification of frequent sequential patterns in transcription factor binding site organization of enhancers 1404KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:14次