BMC Genomics,2019年
Huan Liu, Ruiliang Zhu, Bernard Goffinet, Shouzhou Zhang, Shanshan Dong, Yang Liu, Chaoxian Zhao, Li Zhang, Hong Wu, Yu Jia
LicenseType:Unknown |
BMC Genomics,2019年
Dan Long, Fang-ling Du, Tian-xin Xiang, La-Gen Wan, Dan-dan Wei, Yang Liu, Wei Zhang, Lan-lan Zhu
LicenseType:Unknown |
BMC Genomics,2019年
Yongjun He, Lu Zhou, Huoying Chen, Yang Liu, Hang Chen
LicenseType:Unknown |
BMC Genomics,2023年
Yongfei Peng, Jianwei Ye, Jichao Tian, Qiuwen Hu, Xiaoqi Wang, Yadong Yang, Yilong Wang, Yong Jiang, Zixiao Li, Xin Qiu, Xia Meng, Jinxi Lin, Yang Liu, Hao Li, Yanfeng Shi, Zhe Xu, Si Cheng, Yongjun Wang
LicenseType:CC BY |
BackgroundIn large-scale high-throughput sequencing projects and biobank construction, sample tagging is essential to prevent sample mix-ups. Despite the availability of fingerprint panels for DNA data, little research has been conducted on sample tagging of whole genome bisulfite sequencing (WGBS) data. This study aims to construct a pipeline and identify applicable fingerprint panels to address this problem.ResultsUsing autosome-wide A/T polymorphic single nucleotide variants (SNVs) obtained from whole genome sequencing (WGS) and WGBS of individuals from the Third China National Stroke Registry, we designed a fingerprint panel and constructed an optimized pipeline for tagging WGBS data. This pipeline used Bis-SNP to call genotypes from the WGBS data, and optimized genotype comparison by eliminating wildtype homozygous and missing genotypes, and retaining variants with identical genomic coordinates and reference/alternative alleles. WGS-based and WGBS-based genotypes called from identical or different samples were extensively compared using hap.py. In the first batch of 94 samples, the genotype consistency rates were between 71.01%-84.23% and 51.43%-60.50% for the matched and mismatched WGS and WGBS data using the autosome-wide A/T polymorphic SNV panel. This capability to tag WGBS data was validated among the second batch of 240 samples, with genotype consistency rates ranging from 70.61%-84.65% to 49.58%-61.42% for the matched and mismatched data, respectively. We also determined that the number of genetic variants required to correctly tag WGBS data was on the order of thousands through testing six fingerprint panels with different orders for the number of variants. Additionally, we affirmed this result with two self-designed panels of 1351 and 1278 SNVs, respectively. Furthermore, this study confirmed that using the number of genetic variants with identical coordinates and ref/alt alleles, or identical genotypes could not correctly tag WGBS data.ConclusionThis study proposed an optimized pipeline, applicable fingerprint panels, and a lower boundary for the number of fingerprint genetic variants needed for correct sample tagging of WGBS data, which are valuable for tagging WGBS data and integrating multi-omics data for biobanks.
5 Evaluation of genetic variation among Brazilian soybean cultivars through genome resequencing [期刊论文]
BMC Genomics,2016年
Marcelo Fernandes de Oliveira, Francismar Corrêa Marcelino-Guimarães, João Vitor Maldonado dos Santos, Ricardo Vilela Abdelnoor, Yang Liu, Juexin Wang, Saad M. Khan, Trupti Joshi, Dong Xu, Babu Valliyodan, Tri D. Vuong, Henry T. Nguyen
LicenseType:CC BY |
BackgroundSoybean [Glycine max (L.) Merrill] is one of the most important legumes cultivated worldwide, and Brazil is one of the main producers of this crop. Since the sequencing of its reference genome, interest in structural and allelic variations of cultivated and wild soybean germplasm has grown. To investigate the genetics of the Brazilian soybean germplasm, we selected soybean cultivars based on the year of commercialization, geographical region and maturity group and resequenced their genomes.ResultsWe resequenced the genomes of 28 Brazilian soybean cultivars with an average genome coverage of 14.8X. A total of 5,835,185 single nucleotide polymorphisms (SNPs) and 1,329,844 InDels were identified across the 20 soybean chromosomes, with 541,762 SNPs, 98,922 InDels and 1,093 CNVs that were exclusive to the 28 Brazilian cultivars. In addition, 668 allelic variations of 327 genes were shared among all of the Brazilian cultivars, including genes related to DNA-dependent transcription-elongation, photosynthesis, ATP synthesis-coupled electron transport, cellular respiration, and precursors of metabolite generation and energy. A very homogeneous structure was also observed for the Brazilian soybean germplasm, and we observed 41 regions putatively influenced by positive selection. Finally, we detected 3,880 regions with copy-number variations (CNVs) that could help to explain the divergence among the accessions evaluated.ConclusionsThe large number of allelic and structural variations identified in this study can be used in marker-assisted selection programs to detect unique SNPs for cultivar fingerprinting. The results presented here suggest that despite the diversification of modern Brazilian cultivars, the soybean germplasm remains very narrow because of the large number of genome regions that exhibit low diversity. These results emphasize the need to introduce new alleles to increase the genetic diversity of the Brazilian germplasm.
BMC Genomics,2013年
Dong Dong, Yang Liu, Shuyi Zhang, Ming Lei
LicenseType:CC BY |
BackgroundBats have aroused great interests of researchers for the sake of their advanced echolocation system. However, this highly specialized trait is not characteristic of Old World fruit bats.ResultsTo comprehensively explore the underlying molecular basis between echolocating and non-echolocating bats, we employed a sequence-based approach to compare the inner ear expression difference between the Rickett’s big-footed bat (Myotis ricketti, echolocating bat) and the Greater short-nosed fruit bat (Cynopterus sphinx, non-echolocating bat). De novo sequence assemblies were developed for both species. The results showed that the biological implications of up-regulated genes in M. ricketti were significantly over-represented in biological process categories such as ‘cochlea morphogenesis’, ‘inner ear morphogenesis’ and ‘sensory perception of sound’, which are consistent with the inner ear morphological and physiological differentiation between the two bat species. Moreover, the expression of TMC1 gene confirmed its important function in echolocating bats.ConclusionOur work presents the first transcriptome comparison between echolocating and non-echolocating bats, and provides information about the genetic basis of their distinct hearing traits.