学位论文详细信息
Bioinformatics analyses of non-coding genomic elements
transcription factors;hox genes;endogenous retroviruses;koala;sumatran rhinoceros;Phascolarctos cinereus;koala retrovirus;korv
Zhao, Kai
关键词: transcription factors;    hox genes;    endogenous retroviruses;    koala;    sumatran rhinoceros;    Phascolarctos cinereus;    koala retrovirus;    korv;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/90895/ZHAO-DISSERTATION-2016.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

Mammalian genomes consist primarily of non-coding sequences (Kellis et al. 2014). Originally castigated as "junk DNA", many non-coding regions have now been characterized as having functional roles, or have been determined to be the causal agent for diseases. Additionally, sequences that are non-functional can be used as neutral markers for population genetics. Determining the role of non-coding sequences or finding sequences usable as neutral markers is computationally and biologically non-trivial. However, recent advances in molecular biology, in particular the reduced cost of next-generation sequencing (NGS), have enabled new experiments that involve these sequences. I will discuss studies using bioinformatics that leveraged these advances to characterize three types of non-coding sequences: endogenous retroviruses, microsatellite markers and transcription factor binding sites. I conducted the bioinformatics design, coding and analyses, working with collaborators who verified findings in the laboratory. The only retrovirus known to be currently transitioning from exogenous to endogenous form is the koala retrovirus (KoRV), making koalas (Phascolarctos cinereus) ideal for examining the early stages of retroviral endogenization. In the first study, I developed a bioinformatics routine to identify distinct retroviral integrants from NGS reads of KoRV retrovirus flanks isolated using koala genomic DNA. In the second study, I developed computationally efficient, user-friendly software that would identify polymorphic microsatellite loci using NGS reads, then design oligonucleotide primers appropriate for amplifying those loci. We developed this software to enable studies to improve understanding of population structure, estimate population size and estimate genetic diversity in genetically depauperate wildlife species. In the third study, I developed a bioinformatics pipeline to characterize gene expression changes during development in the fetal limb tissue of several mammalian species, to better understand the mechanistic differences across evolutionary lineages. We compared development in four species of mammals. The house mouse was used since it is a well-characterized model organism with five digits. The domestic pig was used since it is a well-studied agricultural animal and a model for digit reduction. A species of bat was used since bats undergo wing development. Finally, a species of opossum was used as an outgroup to the three eutherian species.

【 预 览 】
附件列表
Files Size Format View
Bioinformatics analyses of non-coding genomic elements 2194KB PDF download
  文献评价指标  
  下载次数:14次 浏览次数:22次