| BMC Medical Imaging | |
| Convolutional neural network -based phantom image scoring for mammography quality control | |
| Research | |
| Veli-Matti Sundell1  Teemu Mäkelä1  Touko Kaasalainen2  Anne-Mari Vitikainen2  | |
| [1] Department of Physics, University of Helsinki, P.O. Box 64, 00014, Helsinki, Finland;HUS Diagnostic Center, Radiology, University of Helsinki and Helsinki University Hospital, P.O. Box 340, Haartmaninkatu 4, 00290, Helsinki, Finland;HUS Diagnostic Center, Radiology, University of Helsinki and Helsinki University Hospital, P.O. Box 340, Haartmaninkatu 4, 00290, Helsinki, Finland; | |
| 关键词: Mammography; Quality control; Convolutional neural network; | |
| DOI : 10.1186/s12880-022-00944-w | |
| received in 2022-06-21, accepted in 2022-11-28, 发布年份 2022 | |
| 来源: Springer | |
PDF
|
|
【 摘 要 】
BackgroundVisual evaluation of phantom images is an important, but time-consuming part of mammography quality control (QC). Consistent scoring of phantom images over the device’s lifetime is highly desirable. Recently, convolutional neural networks (CNNs) have been applied to a wide range of image classification problems, performing with a high accuracy. The purpose of this study was to automate mammography QC phantom scoring task by training CNN models to mimic a human reviewer.MethodsEight CNN variations consisting of three to ten convolutional layers were trained for detecting targets (fibres, microcalcifications and masses) in American College of Radiology (ACR) accreditation phantom images and the results were compared with human scoring. Regular and artificially degraded/improved QC phantom images from eight mammography devices were visually evaluated by one reviewer. These images were used in training the CNN models. A separate test set consisted of daily QC images from the eight devices and separately acquired images with varying dose levels. These were scored by four reviewers and considered the ground truth for CNN performance testing.ResultsAlthough hyper-parameter search space was limited, an optimal network depth after which additional layers resulted in decreased accuracy was identified. The highest scoring accuracy (95%) was achieved with the CNN consisting of six convolutional layers. The highest deviation between the CNN and the reviewers was found at lowest dose levels. No significant difference emerged between the visual reviews and CNN results except in case of smallest masses.ConclusionA CNN-based automatic mammography QC phantom scoring system can score phantom images in a good agreement with human reviewers, and can therefore be of benefit in mammography QC.
【 授权许可】
CC BY
© The Author(s) 2022
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202305064749436ZK.pdf | 2153KB | ||
| Fig. 1 | 68KB | Image | |
| Fig. 5 | 745KB | Image | |
| Fig. 2 | 621KB | Image | |
| 12936_2022_4386_Article_IEq179.gif | 1KB | Image |
【 图 表 】
12936_2022_4386_Article_IEq179.gif
Fig. 2
Fig. 5
Fig. 1
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
PDF