期刊论文详细信息
BMC Bioinformatics
Error correction and statistical analyses for intra-host comparisons of feline immunodeficiency virus diversity from high-throughput sequencing data
Yang Liu4  Francesca Chiaromonte4  Howard Ross2  Raunaq Malhotra3  Daniel Elleder1  Mary Poss4 
[1] Current address: Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Videnska 1083, Prague 14000, Czech Republic
[2] Bioinformatics Institute, School of Biological Sciences, University of Auckland, Auckland 1142, New Zealand
[3] Department of Computer Science and Engineering, The Pennsylvania State University, University Park 16802, PA, USA
[4] The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park 16802, PA, USA
关键词: Viral Coinfection;    Linear Mixed Model;    Error Correction;    FIV;    Next Generation Sequencing;    Virus Population Dynamics;   
Others  :  1231823
DOI  :  10.1186/s12859-015-0607-z
 received in 2014-03-06, accepted in 2015-04-29,  发布年份 2015
【 摘 要 】

Background

Infection with feline immunodeficiency virus (FIV) causes an immunosuppressive disease whose consequences are less severe if cats are co-infected with an attenuated FIV strain (PLV). We use virus diversity measurements, which reflect replication ability and the virus response to various conditions, to test whether diversity of virulent FIV in lymphoid tissues is altered in the presence of PLV. Our data consisted of the 3′ half of the FIV genome from three tissues of animals infected with FIV alone, or with FIV and PLV, sequenced by 454 technology.

Results

Since rare variants dominate virus populations, we had to carefully distinguish sequence variation from errors due to experimental protocols and sequencing. We considered an exponential-normal convolution model used for background correction of microarray data, and modified it to formulate an error correction approach for minor allele frequencies derived from high-throughput sequencing. Similar to accounting for over-dispersion in counts, this accounts for error-inflated variability in frequencies – and quite effectively reproduces empirically observed distributions. After obtaining error-corrected minor allele frequencies, we applied ANalysis Of VAriance (ANOVA) based on a linear mixed model and found that conserved sites and transition frequencies in FIV genes differ among tissues of dual and single infected cats. Furthermore, analysis of minor allele frequencies at individual FIV genome sites revealed 242 sites significantly affected by infection status (dual vs. single) or infection status by tissue interaction. All together, our results demonstrated a decrease in FIV diversity in bone marrow in the presence of PLV. Importantly, these effects were weakened or undetectable when error correction was performed with other approaches (thresholding of minor allele frequencies; probabilistic clustering of reads). We also queried the data for cytidine deaminase activity on the viral genome, which causes an asymmetric increase in G to A substitutions, but found no evidence for this host defense strategy.

Conclusions

Our error correction approach for minor allele frequencies (more sensitive and computationally efficient than other algorithms) and our statistical treatment of variation (ANOVA) were critical for effective use of high-throughput sequencing data in understanding viral diversity. We found that co-infection with PLV shifts FIV diversity from bone marrow to lymph node and spleen.

【 授权许可】

   
2015 Liu et al.

【 参考文献 】
  • [1]Poss M, Rodrigo AG, Gosink JJ, Learn GH, de Vange PD, Martin HL et al.. Evolution of envelope sequences from the genital tract and peripheral blood of women infected with clade A human immunodeficiency virus type 1. J Virol. 1998; 72:8240-51.
  • [2]Nickle DC, Jensen MA, Shriner D, Brodie SJ, Frenkel LM, Mittler JE et al.. Evolutionary indicators of human immunodeficiency virus type 1 reservoirs and compartments. J Virol. 2003; 77:5540-6.
  • [3]Salemi M, Burkhardt BR, Gray RR, Ghaffari G, Sleasman JW, Goodenow MM. Phylodynamics of HIV-1 in lymphoid and non-lymphoid tissues reveals a central role for the thymus in emergence of CXCR4-using quasispecies. PLoS One. 2007; 2:e950.
  • [4]Blackard JT. HIV compartmentalization: a review on a clinically important phenomenon. Curr HIV Res. 2012; 10:133-42.
  • [5]Burkhard MJ, Dean GA. Transmission and immunopathogenesis of FIV in cats as a model for HIV. Curr HIV Res. 2003; 1:15-29.
  • [6]VandeWoude S, Apetrei C. Going wild: lessons from naturally occurring T-lymphotropic lentiviruses. Clin Microbiol Rev. 2006; 19:728-62.
  • [7]Elder JH, Lin YC, Fink E, Grant CK. Feline immunodeficiency virus (FIV) as a model for study of lentivirus infections: parallels with HIV. Curr HIV Res. 2010; 8:73-80.
  • [8]Bendinelli M, Pistello M, Lombardi S, Poli A, Garzelli C, Matteucci D et al.. Feline immunodeficiency virus: an interesting model for AIDS studies and an important cat pathogen. Clin Microbiol Rev. 1995; 8:87-112.
  • [9]Terwee JA, Yactor JK, Sondgeroth KS, Vandewoude S. Puma lentivirus is controlled in domestic cats after mucosal exposure in the absence of conventional indicators of immunity. J Virol. 2005; 79:2797-806.
  • [10]VandeWoude S, Hageman CA, O’Brien SJ, Hoover EA. Nonpathogenic lion and puma lentiviruses impart resistance to superinfection by virulent feline immunodeficiency virus. J Acquir Immune Defic Syndr. 2002; 29:1-10.
  • [11]Terwee JA, Carlson JK, Sprague WS, Sondgeroth KS, Shropshire SB, Troyer JL et al.. Prevention of immunodeficiency virus induced CD4+ T-cell depletion by prior infection with a non-pathogenic virus. Virology. 2008; 377:63-70.
  • [12]Zheng X, Carver S, Troyer RM, Terwee JA, VandeWoude S. Prior virus exposure alters the long-term landscape of viral replication during feline lentiviral infection. Viruses. 2011; 3:1891-908.
  • [13]Padhi A, Ross H, Terwee J, Vandewoude S, Poss M. Profound differences in virus population genetics correspond to protection from CD4 decline resulting from feline lentivirus coinfection. Viruses. 2010; 2:2663-80.
  • [14]Hoffmann C, Minkah N, Leipzig J, Wang G, Arens MQ, Tebas P et al.. DNA bar coding and pyrosequencing to identify rare HIV drug resistance mutations. Nucleic Acids Res. 2007; 35:e91.
  • [15]Barzon L, Lavezzo E, Militello V, Toppo S, Palu G. Applications of next-generation sequencing technologies to diagnostic virology. Int J Mol Sci. 2011; 12:7861-84.
  • [16]Radford AD, Chapman D, Dixon L, Chantrey J, Darby AC, Hall N. Application of next-generation sequencing technologies in virology. J Gen Virol. 2012; 93:1853-68.
  • [17]Eriksson N, Pachter L, Mitsuya Y, Rhee SY, Wang C, Gharizadeh B et al.. Viral population estimation using pyrosequencing. PLoS Comput Biol. 2008; 4:e1000074.
  • [18]Willerth SM, Pedro HA, Pachter L, Humeau LM, Arkin AP, Schaffer DV. Development of a low bias method for characterizing viral populations using next generation sequencing technology. PLoS One. 2010; 5:e13564.
  • [19]Wright CF, Morelli MJ, Thebaud G, Knowles NJ, Herzyk P, Paton DJ et al.. Beyond the consensus: dissecting within-host viral population diversity of foot-and-mouth disease virus by using next-generation genome sequencing. J Virol. 2011; 85:2266-75.
  • [20]Henn MR, Boutwell CL, Charlebois P, Lennon NJ, Power KA, Macalalad AR et al.. Whole genome deep sequencing of HIV-1 reveals the impact of early minor variants upon immune recognition during acute infection. PLoS Pathog. 2012; 8:e1002529.
  • [21]Zagordi O, Bhattacharya A, Eriksson N, Beerenwinkel N. ShoRAH: estimating the genetic diversity of a mixed sample from next-generation sequencing data. BMC bioinformatics. 2011; 12:119. BioMed Central Full Text
  • [22]Prabhakara S, Malhotra R, Acharya R, Poss M. Mutant-Bin: Unsupervised Haplotype Estimation of Viral Population Diversity Without Reference Genome. J Comput Biol. 2013; 20:453-63.
  • [23]Shendure J, Ji H. Next-generation DNA sequencing. Nat Biotechnol. 2008; 26:1135-45.
  • [24]Romano CM, Lauck M, Salvador FS, Lima CR, Villas-Boas LS, Araujo ES et al.. Inter- and intra-host viral diversity in a large seasonal DENV2 outbreak. PLoS One. 2013; 8:e70318.
  • [25]Wang C, Mitsuya Y, Gharizadeh B, Ronaghi M, Shafer RW. Characterization of mutation spectra with ultra-deep pyrosequencing: application to HIV-1 drug resistance. Genome Res. 2007; 17:1195-201.
  • [26]Morelli MJ, Wright CF, Knowles NJ, Juleff N, Paton DJ, King DP et al.. Evolution of foot-and-mouth disease virus intra-sample sequence diversity during serial transmission in bovine hosts. Vet Res. 2013; 44:12. BioMed Central Full Text
  • [27]Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010; 11:R106. BioMed Central Full Text
  • [28]Hashimoto TB, Edwards MD, Gifford DK. Universal count correction for high-throughput sequencing. PLoS Comput Biol. 2014; 10:e1003494.
  • [29]Robinson MD, McCarthy DJ. Smyth GK: edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010; 26:139-40.
  • [30]Zagordi O, Geyrhofer L, Roth V, Beerenwinkel N. Deep sequencing of a genetically heterogeneous sample: local haplotype reconstruction and read error correction. J Comput Biol. 2010; 17:417-28.
  • [31]Skums P, Dimitrova Z, Campo DS, Vaughan G, Rossi L, Forbi JC et al.. Efficient error correction for next-generation sequencing of viral amplicons. BMC bioinformatics. 2012; 13 Suppl(10):S6. BioMed Central Full Text
  • [32]Beerenwinkel N, Gunthard HF, Roth V, Metzner KJ. Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data. Front Microbiol. 2012; 3:329.
  • [33]Bolstad BM. Low Level Analysis of High-density Oligonucleotide Array Data: Background, Normalization and Summarization. Dissertation. University of California, Berkeley, Department of Statistics. 2004.
  • [34]McGee M, Chen Z. Parameter estimation for the exponential-normal convolution model for background correction of affymetrix GeneChip data. Statistical applications in genetics and molecular biology. 2006; 5:Article24.
  • [35]Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U et al.. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003; 4:249-64.
  • [36]Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs B, Speed TP. Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003; 31:e15.
  • [37]Bolstad BM, Irizarry RA, Astrand M, Speed TP. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003; 19:185-93.
  • [38]Niklas N, Proll J, Danzer M, Stabentheiner S, Hofer K, Gabriel C. Routine performance and errors of 454 HLA exon sequencing in diagnostics. BMC bioinformatics. 2013; 14:176. BioMed Central Full Text
  • [39]Poss M, Ross HA, Painter SL, Holley DC, Terwee JA, Vandewoude S et al.. Feline lentivirus evolution in cross-species infection reveals extensive G-to-A mutation and selection on key residues in the viral polymerase. J Virol. 2006; 80:2728-37.
  • [40]Bowker AH. A test for symmetry in contingency tables. J Am Stat Assoc. 1948; 43:572-4.
  • [41]Stuart A. A test for homogeneity of the marginal distributions in a two-way classification. Biometrika. 1955; 42:412-6.
  • [42]Ababneh F, Jermiin LS, Ma C, Robinson J. Matched-pairs tests of homogeneity with applications to homologous nucleotide sequences. Bioinformatics. 2006; 22:1225-31.
  • [43]Hayward JJ, Rodrigo AG. Molecular epidemiology of feline immunodeficiency virus in the domestic cat (Felis catus). Vet Immunol Immunopathol. 2010; 134:68-74.
  • [44]Chiu Y-L, Greene WC. The APOBEC3 cytidine deaminases: an innate defensive network opposing exogenous retroviruses and endogenous retroelements. Annu Rev Immunol. 2008; 26:317-53.
  • [45]Marin M, Rose KM, Kozak SL, Kabat D. HIV-1 Vif protein binds the editing enzyme APOBEC3G and induces its degradation. Nat Med. 2003; 9:1398-403.
  • [46]Conticello SG, Harris RS, Neuberger MS. The Vif protein of HIV triggers degradation of the human antiretroviral DNA deaminase APOBEC3G. Curr Biol. 2003; 13:2009-13.
  • [47]CLC Genomics Workbench. http://www. clcbio.com/products/clc-genomics-workbench/ webcite
  • [48]Bioconductor: Open Source Software for Bioinformatics. http://www. bioconductor.org/ webcite
  • [49]The R Project for Statistical Computing. http://www. r-project.org/ webcite
  • [50]SAS. http://www. sas.com/ webcite
  文献评价指标  
  下载次数:0次 浏览次数:0次