科技报告详细信息
Deadline-based Workload Management for MapReduce Environments: Pieces of the Perfromance
Verma, Abhishek ; Cherkasova, Ludmila ; Kumar, Vijay S. ; Campbell, Roy H.
HP Development Company
关键词: MapReduce;    Hadoop;    performance;    resource allocation;    job scheduling;   
RP-ID  :  HPL-2012-82
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

Hadoop and the associated MapReduce paradigm have become the de facto platform for cost-effective analytics over "Big Data". There is an increasing number of MapReduce applications associated with live business intelligence that require completion time guarantees. In this work, we introduce and analyze a set of complementary mechanisms that enhance workload management decisions for processing MapReduce jobs with deadlines. The three mechanisms we consider are the following: 1) a policy for job ordering in the processing queue; 2) a mechanism for allocating a tailored number of map and reduce slots to each job with a completion time requirement; 3) a mechanism for allocating and deallocating (if necessary) spare resources in the system among the active jobs. We analyze the functionality and performance benefits of each mechanism via an extensive set of simulations over diverse workload sets. The proposed mechanisms form the integral pieces in the performance puzzle of automated workload management in MapReduce environments.

【 预 览 】
附件列表
Files Size Format View
RO201804100000351LZ 297KB PDF download
  文献评价指标  
  下载次数:27次 浏览次数:43次