期刊论文详细信息
PeerJ
An empirical examination of sample size effects on population demographic estimates in birds using single nucleotide polymorphism (SNP) data
article
Jessica F. McLaughlin1  Kevin Winker1 
[1] University of Alaska Museum & Department of Biology and Wildlife, University of Alaska Fairbanks;Sam Noble Oklahoma Museum of Natural History and Department of Biology, University of Oklahoma
关键词: Population genomics;    Sample size;    Migration;    Effective population size;    Divergence with gene flow;   
DOI  :  10.7717/peerj.9939
学科分类:社会科学、人文和艺术(综合)
来源: Inra
PDF
【 摘 要 】

Sample size is a critical aspect of study design in population genomics research, yet few empirical studies have examined the impacts of small sample sizes. We used datasets from eight diverging bird lineages to make pairwise comparisons at different levels of taxonomic divergence (populations, subspecies, and species). Our data are from loci linked to ultraconserved elements and our analyses used one single nucleotide polymorphism per locus. All individuals were genotyped at all loci, effectively doubling sample size for coalescent analyses. We estimated population demographic parameters (effective population size, migration rate, and time since divergence) in a coalescent framework using Diffusion Approximation for Demographic Inference, an allele frequency spectrum method. Using divergence-with-gene-flow models optimized with full datasets, we subsampled at sequentially smaller sample sizes from full datasets of 6–8 diploid individuals per population (with both alleles called) down to 1:1, and then we compared estimates and their changes in accuracy. Accuracy was strongly affected by sample size, with considerable differences among estimated parameters and among lineages. Effective population size parameters (ν) tended to be underestimated at low sample sizes (fewer than three diploid individuals per population, or 6:6 haplotypes in coalescent terms). Migration (m) was fairly consistently estimated until <2 individuals per population, and no consistent trend of over-or underestimation was found in either time since divergence (T) or theta (Θ = 4Nrefμ). Lineages that were taxonomically recognized above the population level (subspecies and species pairs; that is, deeper divergences) tended to have lower variation in scaled root mean square error of parameter estimation at smaller sample sizes than population-level divergences, and many parameters were estimated accurately down to three diploid individuals per population. Shallower divergence levels (i.e., populations) often required at least five individuals per population for reliable demographic inferences using this approach. Although divergence levels might be unknown at the outset of study design, our results provide a framework for planning appropriate sampling and for interpreting results if smaller sample sizes must be used.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202307100007522ZK.pdf 2515KB PDF download
  文献评价指标  
  下载次数:1次 浏览次数:5次