期刊论文详细信息
BMC Bioinformatics
Methodology and software to detect viral integration site hot-spots
Research Article
Angela P Presson1  Namshin Kim2  Yan Xiaofei3  Sanggu Kim4  Irvin SY Chen5 
[1] Department of Biostatistics, University of California Los Angeles, School of Public Health, 90095, Los Angeles, CA, USA;UCLA AIDS Institute, University of California Los Angeles, 90095, Los Angeles, CA, USA;Department of Microbiology, Immunology, & Molecular Genetics, University of California Los Angeles, 90095, Los Angeles, CA, USA;Korea Research Institute of Bioscience and Biotechnology, 111 Gwahangno, 305-806, Yuseong-gu, Daejeon, Korea;Medical Genetics Institute, Cedars-Sinai Medical Center, 90048, Los Angeles, CA, USA;UCLA AIDS Institute, University of California Los Angeles, 90095, Los Angeles, CA, USA;Department of Microbiology, Immunology, & Molecular Genetics, University of California Los Angeles, 90095, Los Angeles, CA, USA;UCLA AIDS Institute, University of California Los Angeles, 90095, Los Angeles, CA, USA;Department of Microbiology, Immunology, & Molecular Genetics, University of California Los Angeles, 90095, Los Angeles, CA, USA;Department of Medicine, University of California Los Angeles, 90095, Los Angeles, CA, USA;
关键词: Chronic Granulomatous Disease;    RefSeq Gene;    Original Partition;    Common Insertion Site;    Murine Leukemia Virus Vector;   
DOI  :  10.1186/1471-2105-12-367
 received in 2011-04-05, accepted in 2011-09-14,  发布年份 2011
来源: Springer
PDF
【 摘 要 】

BackgroundModern gene therapy methods have limited control over where a therapeutic viral vector inserts into the host genome. Vector integration can activate local gene expression, which can cause cancer if the vector inserts near an oncogene. Viral integration hot-spots or 'common insertion sites' (CIS) are scrutinized to evaluate and predict patient safety. CIS are typically defined by a minimum density of insertions (such as 2-4 within a 30-100 kb region), which unfortunately depends on the total number of observed VIS. This is problematic for comparing hot-spot distributions across data sets and patients, where the VIS numbers may vary.ResultsWe develop two new methods for defining hot-spots that are relatively independent of data set size. Both methods operate on distributions of VIS across consecutive 1 Mb 'bins' of the genome. The first method 'z-threshold' tallies the number of VIS per bin, converts these counts to z-scores, and applies a threshold to define high density bins. The second method 'BCP' applies a Bayesian change-point model to the z-scores to define hot-spots. The novel hot-spot methods are compared with a conventional CIS method using simulated data sets and data sets from five published human studies, including the X-linked ALD (adrenoleukodystrophy), CGD (chronic granulomatous disease) and SCID-X1 (X-linked severe combined immunodeficiency) trials. The BCP analysis of the human X-linked ALD data for two patients separately (774 and 1627 VIS) and combined (2401 VIS) resulted in 5-6 hot-spots covering 0.17-0.251% of the genome and containing 5.56-7.74% of the total VIS. In comparison, the CIS analysis resulted in 12-110 hot-spots covering 0.018-0.246% of the genome and containing 5.81-22.7% of the VIS, corresponding to a greater number of hot-spots as the data set size increased. Our hot-spot methods enable one to evaluate the extent of VIS clustering, and formally compare data sets in terms of hot-spot overlap. Finally, we show that the BCP hot-spots from the repopulating samples coincide with greater gene and CpG island density than the median genome density.ConclusionsThe z-threshold and BCP methods are useful for comparing hot-spot patterns across data sets of disparate sizes. The methodology and software provided here should enable one to study hot-spot conservation across a variety of VIS data sets and evaluate vector safety for gene therapy trials.

【 授权许可】

CC BY   
© Presson et al; licensee BioMed Central Ltd. 2011

【 预 览 】
附件列表
Files Size Format View
RO202311108631391ZK.pdf 1138KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  文献评价指标  
  下载次数:3次 浏览次数:0次