科技报告详细信息
Globally Distributed BookPrep
Reddy, Prakash ; Dudekula, Shariff ; Puthanveedu, Susanth ; Milojicic, Dejan
HP Development Company
关键词: No keywords available;   
RP-ID  :  HPL-2011-133
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

BookPrep is a Print-On-Demand service that takes raw scans and converts them to print-ready files. It requires large amount of storage and takes an average of 5 hours of CPU time to process a single book with about 300 pages. The experiment we conducted is processing of books on Open Cirrus where the data is close to compute servers. At three Open Cirrus sites we installed BookPrep service and we pre-populated each site with region-specific scanned books. When request comes in to process the book, it is routed to the compute node closest to the source data. The compute node is then expected to store the processed data on the same network. The compute nodes are allocated and de-allocated based on demand. There is a cloud based metadata repository that is used to update the metadata associated with each book regardless of where the source and derived data is stored. The goal of this experiment is to determine if performance can be improved if the compute is moved close to data and we would like to see if that same principal can be applied to pull based scheduling model.

【 预 览 】
附件列表
Files Size Format View
RO201804100002852LZ 492KB PDF download
  文献评价指标  
  下载次数:19次 浏览次数:19次