期刊论文详细信息
Biodiversity Information Science and Standards
SpOccSum: An easy-to-use Python tool to summarize species occurrence data from material examined lists in taxonomic revisions
article
Michael Trizna1  Torsten Dikow2 
[1] Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution;Department of Entomology, National Museum of Natural History, Smithsonian Institution
关键词: biodiversity data;    species occurrence;    seasonal incidence;    Python;    Jupyter Notebook;   
DOI  :  10.3897/biss.3.36513
来源: Pensoft
PDF
【 摘 要 】

Taxonomic revisions contain crucial biodiversity data in the material examined sections for each species. In entomology, material examined lists minimally include the collecting locality, date of collection, and the number of specimens of each collection event. Insect species might be represented in taxonomic revisions by only a single specimen or hundreds to thousands of specimens. Furthermore, revisions of insect genera might treat small genera with few species or include tens to hundreds of species. Summarizing data from such large and complex material examined lists and revisions is cumbersome, time-consuming, and prone to errors. However, providing data on the seasonal incidence, abundance, and collecting period of species is an important way to mobilize primary biodiversity data to understand a species’s occurrence or rarity. Here, we present SpOccSum (Species Occurrence Summary)—a tool to easily obtain metrics of seasonal incidence from specimen occurrence data in taxonomic revisions. SpOccSum is written in Python (Python Software Foundation 2019) and accessible through the Anaconda Python/R Data Science Platform as a Jupyter Notebook (Kluyver et al. 2016). The tool takes a simple list of specimen data containing species name, locality, date of collection (preferably separated by day, month, and year), and number of specimens in CSV format and generates a series of tables and graphs summarizing:number of specimens per species,number of specimens collected per month,number of unique collection events, as well asearliest, andmost recent collecting year of each species.The results can be exported as graphics or as csv-formatted tables and can easily be included in manuscripts for publication. An example of an early version of the summary produced by SpOccSum can be viewed in Tables 1, 2 from Markee and Dikow (2018). To accommodate seasonality in the Northern and Southern Hemispheres, users can choose to start the data display with either January or July. When geographic coordinates are available and species have widespread distributions spanning, for example, the equator, the user can itemize particular regions such as North of Tropic of Cancer (23.5˚N), Tropic of Cancer to the Equator, Equator to Tropic of Capricorn, and South of Tropic of Capricorn (23.5˚S). Other features currently in development include the ability to produce distribution maps from the provided data (when geographic coordinates are included) and the option to export specimen occurrence data as a Darwin-Core Archive ready for upload to the Global Biodiversity Information Facility (GBIF).

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO202307130002137ZK.pdf 105KB PDF download
  文献评价指标  
  下载次数:4次 浏览次数:1次