期刊论文详细信息
BMC Bioinformatics
Zipper plot: visualizing transcriptional activity of genomic regions
Methodology Article
Celine Everaert1  Jasper Anckaert1  Pieter-Jan Volders1  Dries Rombaut1  Francisco Avila Cobos1  Pieter Mestdagh1  Jo Vandesompele1  Katleen De Preter1 
[1] Center for Medical Genetics, Ghent University, De Pintelaan 185, Ghent, Belgium;Cancer Research Institute Ghent, De Pintelaan 185, Ghent, Belgium;Bioinformatics Institute Ghent from Nucleotides to Networks, De Pintelaan 185, Ghent, Belgium;
关键词: Transcription Start Site;    Genomic Feature;    Transcript Model;    lncRNA Transcript;    Close Peak;   
DOI  :  10.1186/s12859-017-1651-7
 received in 2017-02-04, accepted in 2017-04-25,  发布年份 2017
来源: Springer
PDF
【 摘 要 】

BackgroundReconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs.ResultsTo tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot.ConclusionUsing the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5′-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool.

【 授权许可】

CC BY   
© The Author(s). 2017

【 预 览 】
附件列表
Files Size Format View
RO202311101000128ZK.pdf 1030KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  文献评价指标  
  下载次数:0次 浏览次数:0次