期刊论文详细信息
Frontiers in Environmental Science
Scientific Data Management in the Age of Big Data: An Approach Supporting a Resilience Index Development Effort
Deborah N. Vivian1  Linda C. Harwell1  Michelle D. McLaughlin2  Stephen F. Hafner3 
[1] National Health and Environmental Effects Research Laboratory, Gulf Ecology Division, Office of Research and Development, U.S. Environmental Protection Agency, Gulf Breeze, FL, United States;Student Services Contractor, Oak Ridge Associated Universities, Oak Ridge, TN, United States;Student Services Contractor, University of West Florida, Pensacola, FL, United States;
关键词: resilience;    indicators;    data management;    framework;    curation;   
DOI  :  10.3389/fenvs.2019.00072
来源: DOAJ
【 摘 要 】

The increased availability of publicly available data is, in many ways, changing our approach to conducting research. Not only are cloud-based information resources providing supplementary data to bolster traditional scientific activities (e.g., field studies, laboratory experiments), they also serve as the foundation for secondary data research projects such as indicator development. Indicators and indices are a convenient way to synthesize disparate information to address complex scientific questions that are difficult to measure directly (e.g., resilience, sustainability, well-being). In the current literature, there is no shortage of indicator or index examples derived from secondary data with a growing number that are scientifically focused. However, little information is provided describing the management approaches and best practices used to govern the data underpinnings supporting these efforts. From acquisition to storage and maintenance, secondary data research products rely on the availability of relevant, high-quality data, repeatable data handling methods and a multi-faceted data flow process to promote and sustain research transparency and integrity. The U.S. Environmental Protection Agency recently published a report describing the development of a climate resilience screening index which used over one million data points to calculate the final index. The pool of data was derived exclusively from secondary sources such as the U.S. Census Bureau, Bureau of Labor Statistics, Postal Service, Housing and Urban Development, Forestry Services and others. Available data were presented in various forms including portable document format (PDF), delimited ASCII and proprietary format (e.g., Microsoft Excel, ESRI ArcGIS). The strategy employed for managing these data in an indicator research and development effort represented a blend of business practices, information science, and the scientific method. This paper describes the approach, highlighting key points unique for managing the data assets of a small-scale research project in an era of “big data.”

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:3次