BMC Bioinformatics,2021年
Wei Zhang, Hao Wen, Jing Li, Lu Wang, Haoyu Wang, Jing Zhang, Ting Sun, Lin Li, Yufei He, Zixuan Xiao, Yong Liu, Wendong Li, Yifan Chen, Guang Liu, Yubo Fan, Xiaohan Han
LicenseType:CC BY |
2 MethHaplo: combining allele-specific DNA methylation and SNPs for haplotype region identification [期刊论文]
BMC Bioinformatics,2020年
Wing-Kin Sung, Ze Wang, Jing Li, Qiangwei Zhou, Guoliang Li
LicenseType:CC BY |
BMC Bioinformatics,2021年
Wen Chen, Jing Li, Bang-Zhen Pan, Mingyong Tang, Maosheng Chen, Changning Liu, Zeng-Fu Xu, Xuan Zhang
LicenseType:CC BY |
BMC Bioinformatics,2015年
Ke Hu, Jing Li, Angela H. Ting
LicenseType:Unknown |
BackgroundBisulfite sequencing is one of the most widely used technologies in analyzing DNA methylation patterns, which are important in understanding and characterizing the mechanism of DNA methylation and its functions in disease development. Efficient and user-friendly tools are critical in carrying out such analysis on high-throughput bisulfite sequencing data. However, existing tools are either not scalable well, or inadequate in providing visualization and other desirable functionalities.ResultsIn order to handle ultra large sequencing data and to provide additional functions and features, we have developed BSPAT, a fast online tool for bisulfite sequencing pattern analysis. With a user-friendly web interface, BSPAT seamlessly integrates read mapping/quality control/methylation calling with methylation pattern generation and visualization. BSPAT has the following important features: 1) instead of using multiple/pairwise sequence alignment methods, BSPAT adopts an efficient and widely used sequence mapping tool to provide fast alignment of sequence reads; 2) BSPAT summarizes and visualizes DNA methylation co-occurrence patterns at a single nucleotide level, which provide valuable information in understanding the mechanism and regulation of DNA methylation; 3) based on methylation co-occurrence patterns, BSPAT can automatically detect potential allele-specific methylation (ASM) patterns, which can greatly enhance the detection and analysis of ASM patterns; 4) by linking directly with other popular databases and tools, BSPAT allows users to perform integrative analysis of methylation patterns with other genomic features together within regions of interest.ConclusionBy utilizing a real bisulfite sequencing dataset generated from prostate cancer cell lines, we have shown that BSPAT is highly efficient. It has also reported some interesting methylation co-occurrence patterns and a potential allele-specific methylation case. In conclusion, BSPAT is an efficient and convenient tool for high-throughput bisulfite sequencing data analysis that can be broadly used.
BMC Bioinformatics,2014年
Jack Y Yang, William Yang, Qingzhong Liu, Zhongxue Chen, Jing Li, Mary Qu Yang
LicenseType:CC BY |
BackgroundCombining information from different studies is an important and useful practice in bioinformatics, including genome-wide association study, rare variant data analysis and other set-based analyses. Many statistical methods have been proposed to combine p-values from independent studies. However, it is known that there is no uniformly most powerful test under all conditions; therefore, finding a powerful test in specific situation is important and desirable.ResultsIn this paper, we propose a new statistical approach to combining p-values based on gamma distribution, which uses the inverse of the p-value as the shape parameter in the gamma distribution.ConclusionsSimulation study and real data application demonstrate that the proposed method has good performance under some situations.
BMC Bioinformatics,2010年
Xin Li, Jing Li, Yixuan Chen
LicenseType:Unknown |
BackgroundHaplotype-based approaches have been extensively studied for case-control association mapping in recent years. It has been shown that haplotype methods can provide more consistent results comparing to single-locus based approaches, especially in cases where causal variants are not typed. Improved power has been observed by clustering similar or rare haplotypes into groups to reduce the degrees of freedom of association tests. For family-based association studies, one commonly used strategy is Transmission Disequilibrium Tests (TDT), which examine the imbalanced transmission of alleles/haplotypes to affected and normal children. Many extensions have been developed to deal with general pedigrees and continuous traits.ResultsIn this paper, we propose a new haplotype-based association method for family data that is different from the TDT framework. Our approach (termed F_HapMiner) is based on our previous successful experiences on haplotype inference from pedigree data and haplotype-based association mapping. It first infers diplotype pairs of each individual in each pedigree assuming no recombination within a family. A phenotype score is then defined for each founder haplotype. Finally, F_HapMiner applies a clustering algorithm on those founder haplotypes based on their similarities and identifies haplotype clusters that show significant associations with diseases/traits. We have performed extensive simulations based on realistic assumptions to evaluate the effectiveness of the proposed approach by considering different factors such as allele frequency, linkage disequilibrium (LD) structure, disease model and sample size. Comparisons with single-locus and haplotype-based TDT methods demonstrate that our approach consistently outperforms the TDT-based approaches regardless of disease models, local LD structures or allele/haplotype frequencies.ConclusionWe present a novel haplotype-based association approach using family data. Experiment results demonstrate that it achieves significantly higher power than TDT-based approaches.