会议论文详细信息
20th International Conference on Computing in High Energy and Nuclear Physics
ATLAS Distributed Computing Monitoring tools during the LHC Run I
物理学;计算机科学
Schovancová, J.^1 ; Campana, S.^2 ; Girolamo, A. Di^2 ; Jézéquel, S.^3 ; Ueda, I.^4 ; Wenaus, T.^1
Brookhaven National Laboratory, Physics Department, Upton
NY
11973, United States^1
CERN, Geneva 23
CH-1211, Switzerland^2
LAPP, Universite de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France^3
University of Tokyo, International Center for Elementary Particle Physics, Department of Physics, 7-3-1 Hongo, Bunkyo-ku, JP-Tokyo
113-0033, Japan^4
关键词: Accounting informations;    Distributed computing resources;    Graphical elements;    Long-term measurements;    Monitoring applications;    Multiple data sources;    Real time monitoring;    Standardized interfaces;   
Others  :  https://iopscience.iop.org/article/10.1088/1742-6596/513/3/032084/pdf
DOI  :  10.1088/1742-6596/513/3/032084
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】

This contribution summarizes evolution of the ATLAS Distributed Computing (ADC) Monitoring project during the LHC Run I. The ADC Monitoring targets at the three groups of customers: ADC Operations team to early identify malfunctions and escalate issues to an activity or a service expert, ATLAS national contacts and sites for the real-time monitoring and long-term measurement of the performance of the provided computing resources, and the ATLAS Management for long-term trends and accounting information about the ATLAS Distributed Computing resources. During the LHC Run I a significant development effort has been invested in standardization of the monitoring and accounting applications in order to provide extensive monitoring and accounting suite. ADC Monitoring applications separate the data layer and the visualization layer. The data layer exposes data in a predefined format. The visualization layer is designed bearing in mind visual identity of the provided graphical elements, and re-usability of the visualization bits across the different tools. A rich family of various filtering and searching options enhancing available user interfaces comes naturally with the data and visualization layer separation. With a variety of reliable monitoring data accessible through standardized interfaces, the possibility of automating actions under well defined conditions correlating multiple data sources has become feasible. In this contribution we discuss also about the automated exclusion of degraded resources and their automated recovery in various activities.

【 预 览 】
附件列表
Files Size Format View
ATLAS Distributed Computing Monitoring tools during the LHC Run I 578KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:24次