期刊论文详细信息
BMC Bioinformatics
Impact of data resolution on three-dimensional structure inference methods
Research Article
Jincheol Park1  Shili Lin2 
[1] Department of Statistics, Keimyung University, 1095 Dalgubeol-daero, Dalseo-gu, Daegu 42601, South Korea;Department of Statistics, The Ohio State University, 1958 Neil Avenue, Columbus, OH, USA;
关键词: Data resolution;    Excess of zeros;    Correlated read counts;   
DOI  :  10.1186/s12859-016-0894-z
 received in 2016-01-13, accepted in 2016-01-16,  发布年份 2016
来源: Springer
PDF
【 摘 要 】

BackgroundAssays that are capable of detecting genome-wide chromatin interactions have produced massive amount of data and led to great understanding of the chromosomal three-dimensional (3D) structure. As technology becomes more sophisticated, higher-and-higher resolution data are being produced, going from the initial 1 Megabases (Mb) resolution to the current 10 Kilobases (Kb) or even 1 Kb resolution. The availability of genome-wide interaction data necessitates development of analytical methods to recover the underlying 3D spatial chromatin structure, but challenges abound. Most of the methods were proposed for analyzing data at low resolution (1 Mb). Their behaviors are thus unknown for higher resolution data. For such data, one of the key features is the high proportion of “0” contact counts among all available data, in other words, the excess of zeros.ResultsTo address the issue of excess of zeros, in this paper, we propose a truncated Random effect EXpression (tREX) method that can handle data at various resolutions. We then assess the performance of tREX and a number of leading existing methods for recovering the underlying chromatin 3D structure. This was accomplished by creating in-silico data to mimic multiple levels of resolution and submit the methods to a “stress test”. Finally, we applied tREX and the comparison methods to a Hi-C dataset for which FISH measurements are available to evaluate estimation accuracy.ConclusionThe proposed tREX method achieves consistently good performance in all 30 simulated settings considered. It is not only robust to resolution level and underlying parameters, but also insensitive to model misspecification. This conclusion is based on observations made in terms of 3D structure estimation accuracy and preservation of topologically associated domains. Application of the methods to the human lymphoblastoid cell line data on chromosomes 14 and 22 further substantiates the superior performance of tREX: the constructed 3D structure from tREX is consistent with the FISH measurements, and the corresponding distances predicted by tREX have higher correlation with the FISH measurements than any of the comparison methods.SoftwareAn open-source R-package is available at http://www.stat.osu.edu/~statgen/Software/tRex.

【 授权许可】

CC BY   
© Park and Lin. 2016

【 预 览 】
附件列表
Files Size Format View
RO202311101964030ZK.pdf 860KB PDF download
Fig. 4 62KB Image download
MediaObjects/13068_2023_2399_MOESM4_ESM.xlsx 12KB Other download
Fig. 5 91KB Image download
Fig. 3 254KB Image download
Fig. 6 90KB Image download
12951_2017_292_Article_IEq1.gif 1KB Image download
12951_2015_155_Article_IEq62.gif 1KB Image download
MediaObjects/13046_2022_2359_MOESM2_ESM.docx 15KB Other download
Fig. 2 1305KB Image download
Fig. 1 1997KB Image download
12951_2017_255_Article_IEq39.gif 1KB Image download
Fig. 1 690KB Image download
Fig. 2 661KB Image download
Fig. 6 118KB Image download
【 图 表 】

Fig. 6

Fig. 2

Fig. 1

12951_2017_255_Article_IEq39.gif

Fig. 1

Fig. 2

12951_2015_155_Article_IEq62.gif

12951_2017_292_Article_IEq1.gif

Fig. 6

Fig. 3

Fig. 5

Fig. 4

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  文献评价指标  
  下载次数:9次 浏览次数:0次