Frontiers in Genetics | |
Genome-Wide SNP Discovery in Indigenous Cattle Breeds of South Africa | |
Avhashoni A. Zwane2  Este Van Marle-Koster2  Mahlako Linah Makgahlela3  Azwihangwisi Maiwashe3  Jeremy F. Taylor4  Jesse Hoff4  Robert D. Schnabel5  Ananyo Choudhury6  | |
[1] Department of Animal Breeding and Genetics, Agricultural Research Council-Animal Production, Irene, South Africa;Department of Animal and Wildlife Sciences, University of Pretoria, Pretoria, South Africa;Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa;Division of Animal Sciences, University of Missouri, Columbia, MO, United States;Informatics Institute, University of Missouri, Columbia, MO, United States;Sydney Brenner Institute of Molecular Bioscience, University of the Witwatersrand, Johannesburg, South Africa; | |
关键词: indigenous breeds; sequencing; novel variants; annotation; genes; | |
DOI : 10.3389/fgene.2019.00273 | |
来源: DOAJ |
【 摘 要 】
Single nucleotide polymorphism arrays have created new possibilities for performing genome-wide studies to detect genomic regions harboring sequence variants that affect complex traits. However, the majority of validated SNPs for which allele frequencies have been estimated are limited primarily to European breeds. The objective of this study was to perform SNP discovery in three South African indigenous breeds (Afrikaner, Drakensberger, and Nguni) using whole genome sequencing. DNA was extracted from blood and hair samples, quantified and prepared at 50 ng/μl concentration for sequencing at the Agricultural Research Council Biotechnology Platform using an Illumina HiSeq 2500. The fastq files were used to call the variants using the Genome Analysis Tool Kit. A total of 1,678,360 were identified as novel using Run 6 of 1000 Bull Genomes Project. Annotation of the identified variants classified them into functional categories. Within the coding regions, about 30% of the SNPs were non-synonymous substitutions that encode for alternate amino acids. The study of distribution of SNP across the genome identified regions showing notable differences in the densities of SNPs among the breeds and highlighted many regions of functional significance. Gene ontology terms identified genes such as MLANA, SYT10, and CDC42EP5 that have been associated with coat color in mouse, and ADAMS3, DNAJC3, and PAG5 genes have been associated with fertility in cattle. Further analysis of the variants detected 688 candidate selective sweeps (ZHp Z-scores ≤ -4) across all three breeds, of which 223 regions were assigned as being putative selective sweeps (ZHp scores ≤-5). We also identified 96 regions with extremely low ZHp Z-scores (≤-6) in Afrikaner and Nguni. Genes such as KIT and MITF that have been associated with skin pigmentation in cattle and CACNA1C, which has been associated with biopolar disorder in human, were identified in these regions. This study provides the first analysis of sequence data to discover SNPs in indigenous South African cattle breeds. The information will play an important role in our efforts to understand the genetic history of our cattle and in designing appropriate breed improvement programmes.
【 授权许可】
Unknown