• 已选条件:
  • × Kevin A. T. Silverstein
  • × BMC Genomics
  • × 2017
 全选  【符合条件的数据共:9条】

BMC Genomics,2017年

Robert M. Stupar, Peng Zhou, Nevin D. Young, Peter Tiffin, Michael J. Sadowsky, Jason R. Miller, Kevin A. T. Silverstein, Karen M. Moll, Nicholas P. Devitt, Diego Fajardo, Joann Mudge, Thiruvarangan Ramaraj

LicenseType:Unknown |

预览  |  原文链接  |  全文  [ 浏览:0 下载:0  ]    

BMC Genomics,2017年

Robert M. Stupar, Junqi Liu, Peter Tiffin, Joseph Guhlin, Nevin D. Young, Peng Zhou, Roxanne Denny, Jason R. Miller, Joann Mudge, Thiruvarangan Ramaraj, Andrew D. Farmer, Kelly P. Steele, Kevin A. T. Silverstein

LicenseType:Unknown |

预览  |  原文链接  |  全文  [ 浏览:0 下载:0  ]    

BMC Genomics,2017年

James Gurtowski, W. Richard McCombie, Robert M. Stupar, Junqi Liu, Li Song, Nevin D. Young, Peter Tiffin, Peng Zhou, Roxanne Denny, Michael C. Schatz, Jason R. Miller, Kevin A. T. Silverstein, Joann Mudge, Thiruvarangan Ramaraj, Brian P. Walenz, Susan R. McCouch, Namrata Singh, Lyza G. Maron, Hayan Lee

LicenseType:Unknown |

预览  |  原文链接  |  全文  [ 浏览:0 下载:0  ]    

BMC Genomics,2017年

Robert M. Stupar, Junqi Liu, Peter Tiffin, Joseph Guhlin, Roxanne Denny, Peng Zhou, Nevin D. Young, Jason R. Miller, Andrew D. Farmer, Thiruvarangan Ramaraj, Joann Mudge, Kelly P. Steele, Kevin A. T. Silverstein

LicenseType:CC BY |

预览  |  原文链接  |  全文  [ 浏览:0 下载:0  ]    

BackgroundPrevious studies exploring sequence variation in the model legume, Medicago truncatula, relied on mapping short reads to a single reference. However, read-mapping approaches are inadequate to examine large, diverse gene families or to probe variation in repeat-rich or highly divergent genome regions. De novo sequencing and assembly of M. truncatula genomes enables near-comprehensive discovery of structural variants (SVs), analysis of rapidly evolving gene families, and ultimately, construction of a pan-genome.ResultsGenome-wide synteny based on 15 de novo M. truncatula assemblies effectively detected different types of SVs indicating that as much as 22% of the genome is involved in large structural changes, altogether affecting 28% of gene models. A total of 63 million base pairs (Mbp) of novel sequence was discovered, expanding the reference genome space for Medicago by 16%. Pan-genome analysis revealed that 42% (180 Mbp) of genomic sequences is missing in one or more accession, while examination of de novo annotated genes identified 67% (50,700) of all ortholog groups as dispensable – estimates comparable to recent studies in rice, maize and soybean. Rapidly evolving gene families typically associated with biotic interactions and stress response were found to be enriched in the accession-specific gene pool. The nucleotide-binding site leucine-rich repeat (NBS-LRR) family, in particular, harbors the highest level of nucleotide diversity, large effect single nucleotide change, protein diversity, and presence/absence variation. However, the leucine-rich repeat (LRR) and heat shock gene families are disproportionately affected by large effect single nucleotide changes and even higher levels of copy number variation.ConclusionsAnalysis of multiple M. truncatula genomes illustrates the value of de novo assemblies to discover and describe structural variation, something that is often under-estimated when using read-mapping approaches. Comparisons among the de novo assemblies also indicate that different large gene families differ in the architecture of their structural variation.

    BMC Genomics,2017年

    Robert M. Stupar, Peng Zhou, Nevin D. Young, Peter Tiffin, Michael J. Sadowsky, Jason R. Miller, Kevin A. T. Silverstein, Nicholas P. Devitt, Thiruvarangan Ramaraj, Diego Fajardo, Joann Mudge, Karen M. Moll

    LicenseType:CC BY |

    预览  |  原文链接  |  全文  [ 浏览:0 下载:0  ]    

    BackgroundThird generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner.ResultsHere, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly.ConclusionsAdding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies.

      BMC Genomics,2017年

      James Gurtowski, W. Richard McCombie, Robert M. Stupar, Junqi Liu, Li Song, Peter Tiffin, Nevin D. Young, Peng Zhou, Roxanne Denny, Michael C. Schatz, Jason R. Miller, Kevin A. T. Silverstein, Thiruvarangan Ramaraj, Joann Mudge, Brian P. Walenz, Lyza G. Maron, Namrata Singh, Susan R. McCouch, Hayan Lee

      LicenseType:CC BY |

      预览  |  原文链接  |  全文  [ 浏览:0 下载:0  ]    

      BackgroundLong-read and short-read sequencing technologies offer competing advantages for eukaryotic genome sequencing projects. Combinations of both may be appropriate for surveys of within-species genomic variation.MethodsWe developed a hybrid assembly pipeline called “Alpaca” that can operate on 20X long-read coverage plus about 50X short-insert and 50X long-insert short-read coverage. To preclude collapse of tandem repeats, Alpaca relies on base-call-corrected long reads for contig formation.ResultsCompared to two other assembly protocols, Alpaca demonstrated the most reference agreement and repeat capture on the rice genome. On three accessions of the model legume Medicago truncatula, Alpaca generated the most agreement to a conspecific reference and predicted tandemly repeated genes absent from the other assemblies.ConclusionOur results suggest Alpaca is a useful tool for investigating structural and copy number variation within de novo assemblies of sampled populations.