科技报告详细信息
Log Summarization and Anomaly Detection for TroubleshootingDistributed Systems.
Gunter, D. ; Tierney, B. L. ; Brown, A. ; Swany, M. ; Bresnahan, J.
Technical Information Center Oak Ridge Tennessee
关键词: Disstributed data processing;    Algorithms;    Detection;    Data base management;    Data analysis;   
RP-ID  :  DE2008932522
学科分类:工程和技术(综合)
美国|英语
来源: National Technical Reports Library
PDF
【 摘 要 】

Today's system monitoring tools are capable of detecting system failures such as host failures, OS errors, and network partitions in near-real time. Unfortunately, the same cannot yet be said of the end-to-end distributed software stack. Any given action, for example, reliably transferring a directory of files, can involve a wide range of complex and interrelated actions across multiple pieces of software: checking user certificates and permissions, getting details for all files, performing third-party transfers, understanding re-try policy decisions, etc. We present an infrastructure for troubleshooting complex middleware, a general purpose technique for configurable log summarization, and an anomaly detection technique that works in near-real time on running Grid middleware. We present results gathered using this infrastructure from instrumented Grid middleware and applications running on the Emulab testbed. From these results, we analyze the effectiveness of several algorithms at accurately detecting a variety of performance anomalies.

【 预 览 】
附件列表
Files Size Format View
DE2008932522.pdf 240KB PDF download
  文献评价指标  
  下载次数:17次 浏览次数:11次