科技报告详细信息
Overcoming Pitfalls When Using HDF5 Compression
Knox, Larry ; Pourmal, Elana
关键词: COMPUTER PROGRAMS;    DATA COMPRESSION;    MAINTENANCE;    COMPUTATION;    METADATA;    OPEN SOURCE LICENSING (COMPUTERS);    EOS DATA AND INFORMATION SYSTEM;    ALGORITHMS;   
RP-ID  :  GSFC-E-DAA-TN63478
学科分类:地球科学(综合)
美国|英语
来源: NASA Technical Reports Server
PDF
【 摘 要 】

Compression of large datasets in S-NPP, JPSS and other HDF5 data files may substantially reduce their size, in turn reducing disk space requirements and file download time. However, mismatches between the layout of the files' datasets, the HDF5 instance's cache settings, choice of compression algorithm, and the access pattern of applications using the data can sometimes result in poor performance or exhausting machine resources when running the application. Whether designed in advance or modified in response to encountered problems, applications can be tuned to optimize efficiency of data access, avoid unnecessary repeated decompression, and reduce the amount of memory used. Examples will be given of problems that may be encountered, how to use available tools to diagnose or work around them, changing cache settings to conserve memory, and designing access strategies to avoid both performance and memory issues when creating or modifying applications.

【 预 览 】
附件列表
Files Size Format View
20180008456.pdf 799KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:9次