BMC Bioinformatics | |
Two molecular measures of relatedness based on haplotype sharing | |
Methodology Article | |
David Edwards1  | |
[1] Centre for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, Blichers Allé 20, 8830, Tjele, DK, Denmark; | |
关键词: Genomic relationship matrix; Multiset; Acyclic probabilistic finite automata; Haplotype sharing; | |
DOI : 10.1186/s12859-015-0802-y | |
received in 2015-07-26, accepted in 2015-10-29, 发布年份 2015 | |
来源: Springer | |
【 摘 要 】
BackgroundMeasuring the extent of shared ancestry between individuals or organisms is important in many fields, including forensic science, conservation genetics and animal breeding. The traditional approach is to calculate the expected degree of relatedness between individuals in a pedigree. This assumes that the founders of the pedigree are non-inbred and unrelated to each other, which is rarely the case. In contrast, molecular data allow measurement of actual relatedness without knowledge of a pedigree. Methods to do this have been proposed, but generally do not take the lengths of the genomic regions shared between individuals into account.ResultsTwo measures based on the extent of haplotype sharing between genomes are proposed. The intercept measure B estimates the fraction of shared genome between individuals, and the product measure C is closely related to the numerator relationship matrix A. Both are based on a model for the joint distribution of markers at the haplotype level. The two measures are compared to the pedigree-based measure A and to vanRaden’s G, a frequently used molecular measure, using a set of data comprising 5037 dairy cattle. The comparison criteria include the ability to capture genealogical relatedness and the prediction accuracy obtained when used in genomic prediction. Both B and C explain around 95 % of the variation in A, whereas G explains around 6 %. G captures genealogical relatedness poorly, particularly for distantly related individuals (second cousins or farther). Both B and C tend to be larger than A but this can be ascribed to the assumption of non-inbred unrelated founders. Using C in linear mixed models results in slightly higher prediction accuracy than G, and using B results in slightly lower prediction accuracy.ConclusionsThe two proposed measures of relatedness capture genealogical relatedness well, outperforming vanRaden’s G in this respect. When used in genomic prediction models, the product measure leads to slightly improved prediction accuracy.
【 授权许可】
CC BY
© Edwards. 2015
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202311093254397ZK.pdf | 1138KB | download |
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]