会议论文详细信息
21st International Conference on Computing in High Energy and Nuclear Physics
AsyncStageOut: Distributed user data management for CMS Analysis
物理学;计算机科学
Riahi, H.^1 ; Wildish, T.^2 ; Ciangottini, D.^3 ; Hernández, J.M.^4 ; Andreeva, J.^1 ; Balcas, J.^5 ; Karavakis, E.^1 ; Mascheroni, M.^6 ; Tanasijczuk, A.J.^7 ; Vaandering, E.W.^8
European Organization for Nuclear Research, IT Department, Geneva
CH-1211-23, Switzerland^1
Princeton University, Princeton
NJ
08544, United States^2
Universitá and INFN Perugia, Via Alessandro Pascoli, Perugia
06123, Italy^3
CIEMAT, Madrid
28040, Spain^4
DiSCC, Vilnius University, Vilnius
LT-01513, Lithuania^5
INFN Milano-Bicocca, Piazza della Scienza, Milan 3
I-20126, Italy^6
University of California, San Diego
CA
92093-0354, United States^7
Fermi National Laboratory, Batavia
IL
60510, United States^8
关键词: Data monitoring;    Deployment models;    Distributed data analysis;    High availability;    Issues and challenges;    New technologies;    Storage elements;    System scalability;   
Others  :  https://iopscience.iop.org/article/10.1088/1742-6596/664/6/062052/pdf
DOI  :  10.1088/1742-6596/664/6/062052
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】

AsyncStageOut (ASO) is a new component of the distributed data analysis system of CMS, CRAB, designed for managing users' data. It addresses a major weakness of the previous model, namely that mass storage of output data was part of the job execution resulting in inefficient use of job slots and an unacceptable failure rate at the end of the jobs. ASO foresees the management of up to 400k files per day of various sizes, spread worldwide across more than 60 sites. It must handle up to 1000 individual users per month, and work with minimal delay. This creates challenging requirements for system scalability, performance and monitoring. ASO uses FTS to schedule and execute the transfers between the storage elements of the source and destination sites. It has evolved from a limited prototype to a highly adaptable service, which manages and monitors the user file placement and bookkeeping. To ensure system scalability and data monitoring, it employs new technologies such as a NoSQL database and re-uses existing components of PhEDEx and the FTS Dashboard. We present the asynchronous stage-out strategy and the architecture of the solution we implemented to deal with those issues and challenges. The deployment model for the high availability and scalability of the service is discussed. The performance of the system during the commissioning and the first phase of production are also shown, along with results from simulations designed to explore the limits of scalability.

【 预 览 】
附件列表
Files Size Format View
AsyncStageOut: Distributed user data management for CMS Analysis 1717KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:43次