期刊论文详细信息
BMC Bioinformatics
Canary: an atomic pipeline for clinical amplicon assays
Software
Piers Blombery1  Georgina Ryland1  Andrew Fellowes1  Ella R. Thompson2  Stephen B. Fox3  Jason Ellul4  Kenneth D. Doig5  Anthony T. Papenfuss6 
[1] Department of Pathology, Peter MacCallum Cancer Centre, East Melbourne, VIC, Australia;Department of Pathology, Peter MacCallum Cancer Centre, East Melbourne, VIC, Australia;Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, Australia;Department of Pathology, Peter MacCallum Cancer Centre, East Melbourne, VIC, Australia;Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, Australia;Department of Pathology, University of Melbourne, Melbourne, Australia;Research Division, Peter MacCallum Cancer Centre, East Melbourne, VIC, Australia;Research Division, Peter MacCallum Cancer Centre, East Melbourne, VIC, Australia;Department of Pathology, Peter MacCallum Cancer Centre, East Melbourne, VIC, Australia;Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, Australia;Research Division, Peter MacCallum Cancer Centre, East Melbourne, VIC, Australia;Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, Australia;Department of Medical Biology, University of Melbourne, Melbourne, Australia;Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia;
关键词: Targeted sequencing;    Canary;    PathOS;    Pipelines;    Clinical diagnostics;    Variant calling;    Amplicon;   
DOI  :  10.1186/s12859-017-1950-z
 received in 2017-05-25, accepted in 2017-11-22,  发布年份 2017
来源: Springer
PDF
【 摘 要 】

BackgroundHigh throughput sequencing requires bioinformatics pipelines to process large volumes of data into meaningful variants that can be translated into a clinical report. These pipelines often suffer from a number of shortcomings: they lack robustness and have many components written in multiple languages, each with a variety of resource requirements. Pipeline components must be linked together with a workflow system to achieve the processing of FASTQ files through to a VCF file of variants. Crafting these pipelines requires considerable bioinformatics and IT skills beyond the reach of many clinical laboratories.ResultsHere we present Canary, a single program that can be run on a laptop, which takes FASTQ files from amplicon assays through to an annotated VCF file ready for clinical analysis. Canary can be installed and run with a single command using Docker containerization or run as a single JAR file on a wide range of platforms. Although it is a single utility, Canary performs all the functions present in more complex and unwieldy pipelines. All variants identified by Canary are 3′ shifted and represented in their most parsimonious form to provide a consistent nomenclature, irrespective of sequencing variation. Further, proximate in-phase variants are represented as a single HGVS ‘delins’ variant. This allows for correct nomenclature and consequences to be ascribed to complex multi-nucleotide polymorphisms (MNPs), which are otherwise difficult to represent and interpret. Variants can also be annotated with hundreds of attributes sourced from MyVariant.info to give up to date details on pathogenicity, population statistics and in-silico predictors.ConclusionsCanary has been used at the Peter MacCallum Cancer Centre in Melbourne for the last 2 years for the processing of clinical sequencing data. By encapsulating clinical features in a single, easily installed executable, Canary makes sequencing more accessible to all pathology laboratories.Canary is available for download as source or a Docker image at https://github.com/PapenfussLab/Canary under a GPL-3.0 License.

【 授权许可】

CC BY   
© The Author(s). 2017

【 预 览 】
附件列表
Files Size Format View
RO202311096259390ZK.pdf 965KB PDF download
【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  文献评价指标  
  下载次数:4次 浏览次数:2次