会议论文详细信息
17th International Workshop on Advanced Computing and Analysis Techniques in Physics Research
Data Mining as a Service (DMaaS)
物理学;计算机科学
Tejedor, E.^1 ; Piparo, D.^1 ; Mascetti, L.^1 ; Moscicki, J.^1 ; Lamanna, M.^1 ; Mato, P.^1
CERN, Geneva 23
CH-1211, Switzerland^1
关键词: Analysis frameworks;    Computing infrastructures;    Fully integrated;    Interactive mining;    Massive storages;    Scientific softwares;    User authentication;    Virtual computing;   
Others  :  https://iopscience.iop.org/article/10.1088/1742-6596/762/1/012039/pdf
DOI  :  10.1088/1742-6596/762/1/012039
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】
Data Mining as a Service (DMaaS) is a software and computing infrastructure that allows interactive mining of scientific data in the cloud. It allows users to run advanced data analyses by leveraging the widely adopted Jupyter notebook interface. Furthermore, the system makes it easier to share results and scientific code, access scientific software, produce tutorials and demonstrations as well as preserve the analyses of scientists. This paper describes how a first pilot of the DMaaS service is being deployed at CERN, starting from the notebook interface that has been fully integrated with the ROOT analysis framework, in order to provide all the tools for scientists to run their analyses. Additionally, we characterise the service backend, which combines a set of IT services such as user authentication, virtual computing infrastructure, mass storage, file synchronisation, development portals or batch systems. The added value acquired by the combination of the aforementioned categories of services is discussed, focusing on the opportunities offered by the CERNBox synchronisation service and its massive storage backend, EOS.
【 预 览 】
附件列表
Files Size Format View
Data Mining as a Service (DMaaS) 1256KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:23次