期刊论文详细信息
BMC Evolutionary Biology
Uninformative polymorphisms bias genome scans for signatures of selection
Daniel Berner1  Walter Salzburger1  Marius Roesti1 
[1] Zoological Institute, University of Basel, Vesalgasse 1, Basel, CH-4051, Switzerland
关键词: Singleton;    Population differentiation;    Hitchhiking;    Genetic marker;    Gasterosteus aculeatus;    FST;    Allele frequency distribution;   
Others  :  1141019
DOI  :  10.1186/1471-2148-12-94
 received in 2012-01-24, accepted in 2012-06-22,  发布年份 2012
PDF
【 摘 要 】

Background

With the establishment of high-throughput sequencing technologies and new methods for rapid and extensive single nucleotide (SNP) discovery, marker-based genome scans in search of signatures of divergent selection between populations occupying ecologically distinct environments are becoming increasingly popular.

Methods and Results

On the basis of genome-wide SNP marker data generated by RAD sequencing of lake and stream stickleback populations, we show that the outcome of such studies can be systematically biased if markers with a low minor allele frequency are included in the analysis. The reason is that these ‘uninformative’ polymorphisms lack the adequate potential to capture signatures of drift and hitchhiking, the focal processes in ecological genome scans. Bias associated with uninformative polymorphisms is not eliminated by just avoiding technical artifacts in the data (PCR and sequencing errors), as a high proportion of SNPs with a low minor allele frequency is a general biological feature of natural populations.

Conclusions

We suggest that uninformative markers should be excluded from genome scans based on empirical criteria derived from careful inspection of the data, and that these criteria should be reported explicitly. Together, this should increase the quality and comparability of genome scans, and hence promote our understanding of the processes driving genomic differentiation.

【 授权许可】

   
2012 Roesti et al.; licensee BioMed Central Ltd.

【 预 览 】
附件列表
Files Size Format View
20150428094157289.pdf 478KB PDF download
Figure 3. 16KB Image download
Figure 2. 19KB Image download
Figure 1. 112KB Image download
【 图 表 】

Figure 1.

Figure 2.

Figure 3.

【 参考文献 】
  • [1]Wu CI: The genic view of the process of speciation. J Evol Biol 2001, 14:851-865.
  • [2]Mitchell-Olds T, Willis JH, Goldstein DB: Which evolutionary processes influence natural genetic variation for phenotypic traits? Nat Rev Genet 2007, 8:845-856.
  • [3]Nosil P, Funk DJ, Ortiz-Barrientos D: Divergent selection and heterogeneous genomic divergence. Mol Ecol 2009, 18:375-402.
  • [4]Schluter D: Evidence for ecological speciation and its alternative. Science 2009, 323:737-741.
  • [5]Holsinger KE, Weir BS: Genetics in geographically structured populations: defining, estimating and interpreting FST. Nat Rev Genet 2009, 10:639-650.
  • [6]Lewontin RC, Krakauer J: Distribution of gene frequency as a test of theory of selective neutrality of polymorphisms. Genetics 1973, 74:175-195.
  • [7]Maynard Smith J, Haigh J: Hitch-hiking effect of a favorable gene. Genet Res 1974, 23:23-35.
  • [8]Beaumont MA: Adaptation and speciation: what can Fst tell us? Trends Ecol Evol 2005, 20:435-440.
  • [9]Storz JF: Using genome scans of DNA polymorphism to infer adaptive population divergence. Mol Ecol 2005, 14:671-688.
  • [10]Stinchcombe JR, Hoekstra HE: Combining population genomics and quantitative genetics: finding the genes underlying ecologically important traits. Heredity 2008, 100:158-170.
  • [11]Beaumont MA, Nichols RA: Evaluating loci for use in the genetic analysis of population structure. Proc R Soc Lond B 1996, 263:1619-1626.
  • [12]Beaumont MA, Balding DJ: Identifying adaptive genetic divergence among populations from genome scans. Mol Ecol 2004, 13:969-980.
  • [13]Foll M, Gaggiotti O: A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics 2008, 180:977-993.
  • [14]Excoffier L, Hofer T, Foll M: Detecting loci under selection in a hierarchically structured population. Heredity 2009, 103:285-298.
  • [15]Akey JM, Zhang G, Zhang K, Jin L, Shriver MD: Interrogating a high-density SNP map for signatures of natural selection. Genome Res 2002, 12:1805-1814.
  • [16]Turner TL, Hahn MW, Nuzhdin SV: Genomic islands of speciation in Anopheles gambiae. PLoS Biol 2005, 3:e285.
  • [17]Hohenlohe PA, Bassham S, Etter PD, Stiffler N, Johnson EA, Cresko WA: Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet 2010, 6:e1000862.
  • [18]Lawniczak MKN, Emrich SJ, Holloway AK, Regier AP, Olson M, White B, Redmond S, Fulton L, Appelbaum E, Godfrey J, et al.: Widespread divergence between incipient Anopheles gambiae species revealed by whole genome sequences. Science 2010, 330:512-514.
  • [19]Weir BS, Cockerham CC: Estimating F-statistics for the analysis of population-structure. Evolution 1984, 38:1358-1370.
  • [20]Ewens W: Mathematical population genetics. Springer, New York; 1979.
  • [21]Hermisson J, Pennings PS: Soft sweeps: molecular population genetics of adaptation from standing genetic variation. Genetics 2005, 169:2335-2352.
  • [22]Nei M, Tajima F: DNA polymorphism detectable by restriction endonucleases. Genetics 1981, 97:145-163.
  • [23]Nielsen R, Paul JS, Albrechtsen A, Song YS: Genotype and SNP calling from next-generation sequencing data. Nat Rev Genet 2011, 12:443-451.
  • [24]Marth GT, Czabarka E, Murvai J, Sherry ST: The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations. Genetics 2004, 166:351-372.
  • [25]Keightley PD, Eyre-Walker A: Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies. Genetics 2007, 177:2251-2261.
  • [26]Oleksyk TK, Zhao K, De La Vega FM, Gilbert DA, O'Brien SJ, Smith MW: Identifying selected regions from heterozygosity and divergence using a light-coverage genomic dataset from two human populations. PLoS One 2008, 3:e1712.
  • [27]Bradbury IR, Hubert S, Higgins B, Borza T, Bowman S, Paterson IG, Snelgrove PVR, Morris CJ, Gregory RS, Hardie DC, et al.: Parallel adaptive evolution of Atlantic cod on both sides of the Atlantic Ocean in response to temperature. Proc R Soc B 2010, 277:3725-3734.
  • [28]Mardis ER: The impact of next-generation sequencing technology on genetics. Trends Genet 2008, 24:133-141.
  • [29]Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA: Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS One 2008, 3:e3376.
  • [30]Berner D, Roesti M, Hendry AP, Salzburger W: Constraints on speciation suggested by comparing lake-stream stickleback divergence across two continents. Mol Ecol 2010, 19:4963-4978.
  • [31]Berner D, Grandchamp A-C, Hendry AP: Variable progress toward ecological speciation in parapatry: stickleback across eight lake-stream transitions. Evolution 2009, 63:1740-1753.
  • [32]Berner D, Adams DC, Grandchamp A-C, Hendry AP: Natural selection drives patterns of lake-stream divergence in stickleback foraging morphology. J Evol Biol 2008, 21:1653-1665.
  • [33]Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: Genome Project Data P: The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009, 25:2078-2079.
  • [34]R Development Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria; 2010.
  • [35]Morgan M, Anders S, Lawrence M, Aboyoun P, Pagès H, Gentleman R: ShortRead: a bioconductor package for input, quality assessment and exploration of high-throughput sequence data. Bioinformatics 2009, 25:2607-2608.
  • [36]Harpending HC, Batzer MA, Gurven M, Jorde LB, Rogers AR, Sherry ST: Genetic traces of ancient demography. Proc Nat Acad Sci USA 1998, 95:1961-1967.
  • [37]Barreiro LB, Laval G, Quach H, Patin E, Quintana-Murci L: Natural selection has driven population differentiation in modern humans. Nat Genet 2008, 40:340-345.
  • [38]Geiler KA, Harrison RG: A Δ11 desaturase gene genealogy reveals two divergent allelic classes within the European corn borer (Ostrinia nubilalis). BMC Evol Biol 2010, 10:112. BioMed Central Full Text
  • [39]Keightley PD, Halligan DL: Inference of site frequency spectra from high-throughput sequence data: quantification of selection on nonsynonymous and synonymous sites in humans. Genetics 2011, 188:931-940.
  • [40]Nei M, Li W-H: The transient distribution of allele frequencies under mutation pressure. Genet Res 1976, 28:205-214.
  • [41]Li WH: Maintenance of genetic variability under joint effect of mutation, selection and random drift. Genetics 1978, 90:349-382.
  • [42]Fu YX: Statistical properties of segregating sites. Theor Pop Biol 1995, 48:172-197.
  • [43]Eberle MA, Kruglyak L: An analysis of strategies for discovery of single-nucleotide polymorphisms. Genet Epidemiol 2000, 19:29-35.
  • [44]Evans SN, Shvets Y, Slatkin M: Non-equilibrium theory of the allele frequency spectrum. Theor Pop Biol 2007, 71:109-119.
  • [45]Roesti M, Hendry AP, Salzburger W, Berner D: Genome divergence during evolutionary diversification as revealed in replicate lake-stream stickleback population pairs. Mol Ecol 2012, 21:2852-2862.
  文献评价指标  
  下载次数:27次 浏览次数:12次