期刊论文详细信息
Applied Sciences
PICCOLO White-Light and Narrow-Band Imaging Colonoscopic Dataset: A Performance Comparative of Models and Datasets
Francisco Polo1  Roberto Bilbao1  Cristina L. Saratxaga2  Ben Glover2  Ángel José Calderón3  Nagore Andraka3  Francisco M. Sánchez-Margallo4  Luisa F. Sánchez-Peralta5  J. Blas Pagador5  Artzai Picón6 
[1] Avenida Montevideo, 18. E-48013 Bilbao, Spain;Basque Biobank, Basque Foundation for Health Innovation and Research-BIOEF, Ronda de Azkue 1, E-48902 Barakaldo, Spain;Gastroenterology Department, Hospital Universitario Basurto;Imperial College London, Exhibition Road, South Kensington, London SW7 2BU, UK;Jesús Usón Minimally Invasive Surgery Centre, N-521, km 41.7, E-10071 Cáceres, Spain;TECNALIA, Basque Research and Technology Alliance (BRTA), Parque Tecnológico de Bizkaia, C/Geldo. Edificio 700, E-48160 Derio-Bizkaia, Spain;
关键词: deep learning;    colorectal cancer;    public dataset;    clinical metadata;    colonoscopy;    binary masks;   
DOI  :  10.3390/app10238501
来源: DOAJ
【 摘 要 】

Colorectal cancer is one of the world leading death causes. Fortunately, an early diagnosis allows for effective treatment, increasing the survival rate. Deep learning techniques have shown their utility for increasing the adenoma detection rate at colonoscopy, but a dataset is usually required so the model can automatically learn features that characterize the polyps. In this work, we present the PICCOLO dataset, that comprises 3433 manually annotated images (2131 white-light images 1302 narrow-band images), originated from 76 lesions from 40 patients, which are distributed into training (2203), validation (897) and test (333) sets assuring patient independence between sets. Furthermore, clinical metadata are also provided for each lesion. Four different models, obtained by combining two backbones and two encoder–decoder architectures, are trained with the PICCOLO dataset and other two publicly available datasets for comparison. Results are provided for the test set of each dataset. Models trained with the PICCOLO dataset have a better generalization capacity, as they perform more uniformly along test sets of all datasets, rather than obtaining the best results for its own test set. This dataset is available at the website of the Basque Biobank, so it is expected that it will contribute to the further development of deep learning methods for polyp detection, localisation and classification, which would eventually result in a better and earlier diagnosis of colorectal cancer, hence improving patient outcomes.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次