| BMC Genomics | |
| Short DNA sequence patterns accurately identify broadly active human enhancers | |
| Research Article | |
| Ling Chen1  Laura L. Colbran2  John A. Capra3  | |
| [1] Department of Biological Sciences, Vanderbilt University, 37235, Nashville, TN, USA;Vanderbilt Genetics Institute, Vanderbilt University, 37235, Nashville, TN, USA;Vanderbilt Genetics Institute, Vanderbilt University, 37235, Nashville, TN, USA;Department of Biological Sciences, Vanderbilt University, 37235, Nashville, TN, USA;Center for Structural Biology, Departments of Biomedical Informatics and Computer Science, Vanderbilt University, 37235, Nashville, TN, USA; | |
| 关键词: Enhancer; Dinucleotide repeat motif; Machine learning; Gene expression; | |
| DOI : 10.1186/s12864-017-3934-9 | |
| received in 2016-12-05, accepted in 2017-07-09, 发布年份 2017 | |
| 来源: Springer | |
PDF
|
|
【 摘 要 】
BackgroundEnhancers are DNA regulatory elements that influence gene expression. There is substantial diversity in enhancers’ activity patterns: some enhancers drive expression in a single cellular context, while others are active across many. Sequence characteristics, such as transcription factor (TF) binding motifs, influence the activity patterns of regulatory sequences; however, the regulatory logic through which specific sequences drive enhancer activity patterns is poorly understood. Recent analysis of Drosophila enhancers suggested that short dinucleotide repeat motifs (DRMs) are general enhancer sequence features that drive broad regulatory activity. However, it is not known whether the regulatory role of DRMs is conserved across species.ResultsWe performed a comprehensive analysis of the relationship between short DNA sequence patterns, including DRMs, and human enhancer activity in 38,538 enhancers across 411 different contexts. In a machine-learning framework, the occurrence patterns of short sequence motifs accurately predicted broadly active human enhancers. However, DRMs alone were weakly predictive of broad enhancer activity in humans and showed different enrichment patterns than in Drosophila. In general, GC-rich sequence motifs were significantly associated with broad enhancer activity, and consistent with this enrichment, broadly active human TFs recognize GC-rich motifs.ConclusionsOur results reveal the importance of specific sequence motifs in broadly active human enhancers, demonstrate the lack of evolutionary conservation of the role of DRMs, and provide a computational framework for investigating the logic of enhancer sequences.
【 授权许可】
CC BY
© The Author(s). 2017
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202311090174294ZK.pdf | 1466KB |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
- [37]
- [38]
- [39]
- [40]
- [41]
- [42]
- [43]
- [44]
- [45]
- [46]
- [47]
- [48]
PDF