BMC Genomics,2016年
Weixing Feng, Jin Li, Wang Cong, Yulin Deng, Lei Wang, Chengzhen Xu, Hong Liang, Yunlong Liu, Ying Wang, Yue Wang, Todd C. Skaar, Xuefeng Dai
LicenseType:CC BY |
BackgroundIn combination with gene expression profiles, the protein interaction network (PIN) constructs a dynamic network that includes multiple functional modules. Previous studies have demonstrated that rifampin can influence drug metabolism by regulating drug-metabolizing enzymes, transporters, and microRNAs (miRNAs). Rifampin induces gene expression, at least in part, by activating the pregnane X receptor (PXR), which induces gene expression; however, the impact of rifampin on global gene regulation has not been examined under the molecular network frameworks.MethodsIn this study, we extracted rifampin-induced significant differentially expressed genes (SDG) based on the gene expression profile. By integrating the SDG and human protein interaction network (HPIN), we constructed the rifampin-regulated protein interaction network (RrPIN). Based on gene expression measurements, we extracted a subnetwork that showed enriched changes in molecular activity. Using the Kyoto Encyclopedia of Genes and Genomes (KEGG), we identified the crucial rifampin-regulated biological pathways and associated genes. In addition, genes targeted by miRNAs that were significantly differentially expressed in the miRNA expression profile were extracted based on the miRNA-gene prediction tools. The miRNA-regulated PIN was further constructed using associated genes and miRNAs. For each miRNA, we further evaluated the potential impact by the gene interaction network using pathway analysis.Results and DisccussionWe extracted the functional modules, which included 84 genes and 89 interactions, from the RrPIN, and identified 19 key rifampin-response genes that are associated with seven function pathways that include drug response and metabolism, and cancer pathways; many of the pathways were supported by previous studies. In addition, we identified that a set of 6 genes (CAV1, CREBBP, SMAD3, TRAF2, KBKG, and THBS1) functioning as gene hubs in the subnetworks that are regulated by rifampin. It is also suggested that 12 differentially expressed miRNAs were associated with 6 biological pathways.ConclusionsOur results suggest that rifampin contributes to changes in the expression of genes by regulating key molecules in the protein interaction networks. This study offers valuable insights into rifampin-induced biological mechanisms at the level of miRNAs, genes and proteins.
BMC Genomics,2016年
Curt Balch, Fei Huang, A. Keith Dunker, Ao Zhou, Yunlong Liu, Meng Li, Bo He, Weixing Feng, Yue Wang, Bing Xu, Baiyan Li
LicenseType:CC BY |
BackgroundLipopolysaccharide (LPS) is a gram-negative bacterial antigen that triggers a series of cellular responses. LPS pre-conditioning was previously shown to improve the therapeutic efficacy of bone marrow stromal cells/bone-marrow derived mesenchymal stem cells (BMSCs) for repairing ischemic, injured tissue.ResultsIn this study, we systematically evaluated the effects of LPS treatment on genome-wide splicing pattern changes in mouse BMSCs by comparing transcriptome sequencing data from control vs. LPS-treated samples, revealing 197 exons whose BMSC splicing patterns were altered by LPS. Functional analysis of these alternatively spliced genes demonstrated significant enrichment of phosphoproteins, zinc finger proteins, and proteins undergoing acetylation. Additional bioinformatics analysis strongly suggest that LPS-induced alternatively spliced exons could have major effects on protein functions by disrupting key protein functional domains, protein-protein interactions, and post-translational modifications.ConclusionAlthough it is still to be determined whether such proteome modifications improve BMSC therapeutic efficacy, our comprehensive splicing characterizations provide greater understanding of the intracellular mechanisms that underlie the therapeutic potential of BMSCs.
BMC Genomics,2016年
Pengyue Zhang, Xiaoling Xuei, Yangyang Hao, Lang Li, Yunlong Liu, Howard J. Edenberg, Harikrishna Nakshatri
LicenseType:CC BY |
BackgroundSensitive detection of low-frequency single nucleotide variants carries great significance in many applications. In cancer genetics research, tumor biopsies are a mixture of normal and tumor cells from various subpopulations due to tumor heterogeneity. Thus the frequencies of somatic variants from a subpopulation tend to be low. Liquid biopsies, which monitor circulating tumor DNA in blood to detect metastatic potential, also face the challenge of detecting low-frequency variants due to the small percentage of the circulating tumor DNA in blood. Moreover, in population genetics research, although pooled sequencing of a large number of individuals is cost-effective, pooling dilutes the signals of variants from any individual. Detection of low frequency variants is difficult and can be cofounded by sequencing artifacts. Existing methods are limited in sensitivity and mainly focus on frequencies around 2 % to 5 %; most fail to consider differential sequencing artifacts.ResultsWe aimed to push down the frequency detection limit close to the position specific sequencing error rates by modeling the observed erroneous read counts with respect to genomic sequence contexts. 4 distributions suitable for count data modeling (using generalized linear models) were extensively characterized in terms of their goodness-of-fit as well as the performances on real sequencing data benchmarks, which were specifically designed for testing detection of low-frequency variants; two sequencing technologies with significantly different chemistry mechanisms were used to explore systematic errors. We found the zero-inflated negative binomial distribution generalized linear mode is superior to the other models tested, and the advantage is most evident at 0.5 % to 1 % range. This method is also generalizable to different sequencing technologies. Under standard sequencing protocols and depth given in the testing benchmarks, 95.3 % recall and 79.9 % precision for Ion Proton data, 95.6 % recall and 97.0 % precision for Illumina MiSeq data were achieved for SNVs with frequency > = 1 %, while the detection limit is around 0.5 %.ConclusionsOur method enables sensitive detection of low-frequency single nucleotide variants across different sequencing platforms and will facilitate research and clinical applications such as pooled sequencing, cancer early detection, prognostic assessment, metastatic monitoring, and relapses or acquired resistance identification.
4 Improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies [期刊论文]
BMC Genomics,2016年
Weixing Feng, Dingkai Xue, Sen Zhao, Fengfei Song, Ziwei Li, Duojiao Chen, Bo He, Yunlong Liu, Yangyang Hao, Yadong Wang
LicenseType:CC BY |
BackgroundIon Torrent and Ion Proton are semiconductor-based sequencing technologies that feature rapid sequencing speed and low upfront and operating costs, thanks to the avoidance of modified nucleotides and optical measurements. Despite of these advantages, however, Ion semiconductor sequencing technologies suffer much reduced sequencing accuracy at the genomic loci with homopolymer repeats of the same nucleotide. Such limitation significantly reduces its efficiency for the biological applications aiming at accurately identifying various genetic variants.ResultsIn this study, we propose a Bayesian inference-based method that takes the advantage of the signal distributions of the electrical voltages that are measured for all the homopolymers of a fixed length. By cross-referencing the length of homopolymers in the reference genome and the voltage signal distribution derived from the experiment, the proposed integrated model significantly improves the alignment accuracy around the homopolymer regions.ConclusionsBesides improving alignment accuracy on homopolymer regions for semiconductor-based sequencing technologies with the proposed model, similar strategies can also be used on other high-throughput sequencing technologies that share similar limitations.
BMC Genomics,2016年
Zhongming Zhao, Lee Cooper, Kun Huang, Jianhua Ruan, Yufei Huang, Lang Li, Yunlong Liu
LicenseType:CC BY |
We summarize the 2015 International Conference on Intelligent Biology and Medicine (ICIBM 2015) and the editorial report of the supplement to BMC Genomics. The supplement includes 20 research articles selected from the manuscripts submitted to ICIBM 2015. The conference was held on November 13–15, 2015 at Indianapolis, Indiana, USA. It included eight scientific sessions, three tutorials, four keynote presentations, three highlight talks, and a poster session that covered current research in bioinformatics, systems biology, computational biology, biotechnologies, and computational medicine.
6 Comprehensive comparison of molecular portraits between cell lines and tumors in breast cancer [期刊论文]
BMC Genomics,2016年
Meng Li, Shijun Zhang, Guanglong Jiang, Lijun Cheng, Aida Yazdanparast, Sai Mounika Inavolu, Aniruddha Vikram Pawar, Yunlong Liu
LicenseType:CC BY |
BackgroundProper cell models for breast cancer primary tumors have long been the focal point in the cancer’s research. The genomic comparison between cell lines and tumors can investigate the similarity and dissimilarity and help to select right cell model to mimic tumor tissues to properly evaluate the drug reaction in vitro. In this paper, a comprehensive comparison in copy number variation (CNV), mutation, mRNA expression and protein expression between 68 breast cancer cell lines and 1375 primary breast tumors is conducted and presented.ResultsUsing whole genome expression arrays, strong correlations were observed between cells and tumors. PAM50 gene expression differentiated them into four major breast cancer subtypes: Luminal A and B, HER2amp, and Basal-like in both cells and tumors partially. Genomic CNVs patterns were observed between tumors and cells across chromosomes in general. High C > T and C > G trans-version rates were observed in both cells and tumors, while the cells had slightly higher somatic mutation rates than tumors. Clustering analysis on protein expression data can reasonably recover the breast cancer subtypes in cell lines and tumors. Although the drug-targeted proteins ER/PR and interesting mTOR/GSK3/TS2/PDK1/ER_P118 cluster had shown the consistent patterns between cells and tumor, low protein-based correlations were observed between cells and tumors. The expression consistency of mRNA verse protein between cell line and tumors reaches 0.7076. These important drug targets in breast cancer, ESR1, PGR, HER2, EGFR and AR have a high similarity in mRNA and protein variation in both tumors and cell lines. GATA3 and RP56KB1 are two promising drug targets for breast cancer. A total score developed from the four correlations among four molecular profiles suggests that cell lines, BT483, T47D and MDAMB453 have the highest similarity with tumors.ConclusionsThe integrated data from across these multiple platforms demonstrates the existence of the similarity and dissimilarity of molecular features between breast cancer tumors and cell lines. The cell lines only mirror some but not all of the molecular properties of primary tumors. The study results add more evidence in selecting cell line models for breast cancer research.