会议论文详细信息
6th Digital Earth Summit
Comparing NetCDF and SciDB on managing and querying 5D hydrologic dataset
地球科学;计算机科学
Liu, Haicheng^1 ; Xiao, Xiao^1
Changjiang River Scientific Research Institute, Changjiang River Water Resource Commission, Wuhan, China^1
关键词: Dimension orderings;    Extracting information;    High-dimensional;    Hydrologic modelling;    Meteorological modelling;    Multidimensional arrays;    Query performance;    Storage structures;   
Others  :  https://iopscience.iop.org/article/10.1088/1755-1315/46/1/012031/pdf
DOI  :  10.1088/1755-1315/46/1/012031
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】
Efficiently extracting information from high dimensional hydro-meteorological modelling datasets requires smart solutions. Traditional methods are mostly based on files, which can be edited and accessed handily. But they have problems of efficiency due to contiguous storage structure. Others propose databases as an alternative for advantages such as native functionalities for manipulating multidimensional (MD) arrays, smart caching strategy and scalability. In this research, NetCDF file based solutions and the multidimensional array database management system (DBMS) SciDB applying chunked storage structure are benchmarked to determine the best solution for storing and querying 5D large hydrologic modelling dataset. The effect of data storage configurations including chunk size, dimension order and compression on query performance is explored. Results indicate that dimension order to organize storage of 5D data has significant influence on query performance if chunk size is very large. But the effect becomes insignificant when chunk size is properly set. Compression of SciDB mostly has negative influence on query performance. Caching is an advantage but may be influenced by execution of different query processes. On the whole, NetCDF solution without compression is in general more efficient than the SciDB DBMS.
【 预 览 】
附件列表
Files Size Format View
Comparing NetCDF and SciDB on managing and querying 5D hydrologic dataset 1123KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:20次