会议论文详细信息
20th International Conference on Computing in High Energy and Nuclear Physics
System performance monitoring of the ALICE Data Acquisition System with Zabbix
物理学;计算机科学
Telesca, A.^1 ; Carena, F.^1 ; Carena, W.^1 ; Chapeland, S.^1 ; Barroso, V. Chibante^1 ; Costa, F.^1 ; Dénes, E.^2 ; Divià, R.^1 ; Fuchs, U.^1 ; Grigore, A.^1,3 ; Ionita, C.^1 ; Delort, C.^1 ; Simonetti, G.^1,4 ; Soós, C.^1 ; Vyvre, P. Vande^1 ; Haller, B. Von^1
European Organization for Nuclear Research (CERN), Geneva, Switzerland^1
KFKI Research Institute for Particle and Nuclear Physics, Wigner Research Center, Budapest, Hungary^2
Politehnica Univesity of Bucharest, Bucharest, Romania^3
Dipartimento Interateneo di Fisica M. Merlin, Bari, Italy^4
关键词: Data acquisition system;    Data collection method;    Heavy-ion detectors;    Large Hadron Collider;    Large ion collider experiments;    Monitoring information;    Performance monitoring;    Selection criteria;   
Others  :  https://iopscience.iop.org/article/10.1088/1742-6596/513/6/062046/pdf
DOI  :  10.1088/1742-6596/513/6/062046
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】

ALICE (A Large Ion Collider Experiment) is a heavy-ion detector studying the physics of strongly interacting matter and the quark-gluon plasma at the CERN LHC (Large Hadron Collider). The ALICE Data-AcQuisition (DAQ) system handles the data flow from the sub-detector electronics to the permanent data storage in the CERN computing center. The DAQ farm consists of about 1000 devices of many different types ranging from direct accessible machines to storage arrays and custom optical links. The system performance monitoring tool used during the LHC run 1 will be replaced by a new tool for run 2. This paper shows the results of an evaluation that has been conducted on six publicly available monitoring tools. The evaluation has been carried out by taking into account selection criteria such as scalability, flexibility, reliability as well as data collection methods and display. All the tools have been prototyped and evaluated according to those criteria. We will describe the considerations that have led to the selection of the Zabbix monitoring tool for the DAQ farm. The results of the tests conducted in the ALICE DAQ laboratory will be presented. In addition, the deployment of the software on the DAQ machines in terms of metrics collected and data collection methods will be described. We will illustrate how remote nodes are monitored with Zabbix by using SNMP-based agents and how DAQ specific metrics are retrieved and displayed. We will also show how the monitoring information is accessed and made available via the graphical user interface and how Zabbix communicates with the other DAQ online systems for notification and reporting.

【 预 览 】
附件列表
Files Size Format View
System performance monitoring of the ALICE Data Acquisition System with Zabbix 921KB PDF download
  文献评价指标  
  下载次数:9次 浏览次数:22次