期刊论文详细信息
Proteome Science
Informed baseline subtraction of proteomic mass spectrometry data aided by a novel sliding window algorithm
Methodology
Patty J. Solomon1  Tyman E. Stanford1  Christopher J. Bagley1 
[1] School of Mathematical Sciences, The University of Adelaide, North Terrace, 5005, Adelaide, Australia;
关键词: Mathematical morphology;    Top-hat operator;    Line segment algorithm;    Mass spectrometry;    Baseline subtraction;    Pre-processing;    Matrix-assisted laser desorption/ionization;    Time-of-flight;    Unevenly spaced data;   
DOI  :  10.1186/s12953-016-0107-8
 received in 2016-04-24, accepted in 2016-11-01,  发布年份 2016
来源: Springer
PDF
【 摘 要 】

BackgroundProteomic matrix-assisted laser desorption/ionisation (MALDI) linear time-of-flight (TOF) mass spectrometry (MS) may be used to produce protein profiles from biological samples with the aim of discovering biomarkers for disease. However, the raw protein profiles suffer from several sources of bias or systematic variation which need to be removed via pre-processing before meaningful downstream analysis of the data can be undertaken. Baseline subtraction, an early pre-processing step that removes the non-peptide signal from the spectra, is complicated by the following: (i) each spectrum has, on average, wider peaks for peptides with higher mass-to-charge ratios (m/z), and (ii) the time-consuming and error-prone trial-and-error process for optimising the baseline subtraction input arguments. With reference to the aforementioned complications, we present an automated pipeline that includes (i) a novel ‘continuous’ line segment algorithm that efficiently operates over data with a transformed m/z-axis to remove the relationship between peptide mass and peak width, and (ii) an input-free algorithm to estimate peak widths on the transformed m/z scale.ResultsThe automated baseline subtraction method was deployed on six publicly available proteomic MS datasets using six different m/z-axis transformations. Optimality of the automated baseline subtraction pipeline was assessed quantitatively using the mean absolute scaled error (MASE) when compared to a gold-standard baseline subtracted signal. Several of the transformations investigated were able to reduce, if not entirely remove, the peak width and peak location relationship resulting in near-optimal baseline subtraction using the automated pipeline. The proposed novel ‘continuous’ line segment algorithm is shown to far outperform naive sliding window algorithms with regard to the computational time required. The improvement in computational time was at least four-fold on real MALDI TOF-MS data and at least an order of magnitude on many simulated datasets.ConclusionsThe advantages of the proposed pipeline include informed and data specific input arguments for baseline subtraction methods, the avoidance of time-intensive and subjective piecewise baseline subtraction, and the ability to automate baseline subtraction completely. Moreover, individual steps can be adopted as stand-alone routines.

【 授权许可】

CC BY   
© The Author(s) 2016

【 预 览 】
附件列表
Files Size Format View
RO202311103326891ZK.pdf 2172KB PDF download
Fig. 7 1484KB Image download
MediaObjects/13046_2022_2359_MOESM2_ESM.docx 15KB Other download
Fig. 1 1997KB Image download
Fig. 3 3150KB Image download
Fig. 1 67KB Image download
12936_2016_1182_Article_IEq39.gif 1KB Image download
12951_2017_255_Article_IEq41.gif 1KB Image download
MediaObjects/13046_2023_2862_MOESM5_ESM.png 235KB Other download
Fig. 1 118KB Image download
Fig. 2 530KB Image download
MediaObjects/41408_2023_927_MOESM3_ESM.tif 2072KB Other download
Fig. 3 424KB Image download
Fig. 1 4104KB Image download
Fig. 5 4247KB Image download
【 图 表 】

Fig. 5

Fig. 1

Fig. 3

Fig. 2

Fig. 1

12951_2017_255_Article_IEq41.gif

12936_2016_1182_Article_IEq39.gif

Fig. 1

Fig. 3

Fig. 1

Fig. 7

【 参考文献 】
  • [1]
  • [2]
  • [3]
  • [4]
  • [5]
  • [6]
  • [7]
  • [8]
  • [9]
  • [10]
  • [11]
  • [12]
  • [13]
  • [14]
  • [15]
  • [16]
  • [17]
  • [18]
  • [19]
  • [20]
  • [21]
  • [22]
  • [23]
  • [24]
  • [25]
  • [26]
  • [27]
  • [28]
  • [29]
  • [30]
  • [31]
  • [32]
  • [33]
  • [34]
  • [35]
  • [36]
  • [37]
  • [38]
  • [39]
  • [40]
  • [41]
  • [42]
  • [43]
  • [44]
  • [45]
  • [46]
  • [47]
  • [48]
  • [49]
  文献评价指标  
  下载次数:17次 浏览次数:2次