Pathogens | |
Source Attribution of Human Campylobacteriosis Using Whole-Genome Sequencing Data and Network Analysis | |
Lynda Wainaina1  Alessandra Merlotti2  Daniel Remondini2  Tine Hald3  Clementine Henri4  Patrick Murigu Kamau Njage4  | |
[1] Department of Mathematics, University of Padova, 35121 Padova, Italy;Department of Physics and Astronomy, University of Bologna, 40126 Bologna, Italy;Research Group for Foodborne Pathogens and Epidemiology, National Food Institute, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark;Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark; | |
关键词: source attribution; Campylobacter; campylobacteriosis; network analysis; whole-genome sequencing; coherence source clustering; | |
DOI : 10.3390/pathogens11060645 | |
来源: DOAJ |
【 摘 要 】
Campylobacter spp. are a leading and increasing cause of gastrointestinal infections worldwide. Source attribution, which apportions human infection cases to different animal species and food reservoirs, has been instrumental in control- and evidence-based intervention efforts. The rapid increase in whole-genome sequencing data provides an opportunity for higher-resolution source attribution models. Important challenges, including the high dimension and complex structure of WGS data, have inspired concerted research efforts to develop new models. We propose network analysis models as an accurate, high-resolution source attribution approach for the sources of human campylobacteriosis. A weighted network analysis approach was used in this study for source attribution comparing different WGS data inputs. The compared model inputs consisted of cgMLST and wgMLST distance matrices from 717 human and 717 animal isolates from cattle, chickens, dogs, ducks, pigs and turkeys. SNP distance matrices from 720 human and 720 animal isolates were also used. The data were collected from 2015 to 2017 in Denmark, with the animal sources consisting of domestic and imports from 7 European countries. Clusters consisted of network nodes representing respective genomes and links representing distances between genomes. Based on the results, animal sources were the main driving factor for cluster formation, followed by type of species and sampling year. The coherence source clustering (CSC) values based on animal sources were
【 授权许可】
Unknown