科技报告详细信息
NetLogger: A toolkit for distributed system performance tuning anddeb ugging
Tierney, Brian ; Gunter, Dan
Lawrence Berkeley National Laboratory
关键词: Monitoring;    Distributed Systems Performance Analaysis;    Tuning Distributed Systems Performance Analaysis;    42;    Diagnosis;   
DOI  :  10.2172/924785
RP-ID  :  LBNL--51276
RP-ID  :  DE-AC02-05CH11231
RP-ID  :  924785
美国|英语
来源: UNT Digital Library
PDF
【 摘 要 】

Developers and users of high-performance distributed systemsoften observe performance problems such as unexpectedly low throughput orhigh latency. Determining the source of the performance problems requiresdetailed end-to-end instrumentation of all components, including theapplications, operating systems, hosts, and networks. In this paper wedescribe a methodology that enables the real-time diagnosis ofperformance problems in complex high-performance distributed systems. Themethodology includes tools for generating timestamped event logs that canbe used to provide detailed end-to-end application and system levelmonitoring; and tools for visualizing the log data and real-time state ofthe distributed system. This methodology, called NetLogger, has proveninvaluable for diagnosing problems in networks and in distributed systemscode. This approach is novel in that it combines network, host, andapplication-level monitoring, providing a complete view of the entiresystem. NetLogger is designed to be extremely light-weight, and includesa mechanism for reliably collecting monitoring events from multipledistributed locations. This technical report summarizes most importantpoints of several previous papers on NetLogger, and is meant to be usedas a general overview.

【 预 览 】
附件列表
Files Size Format View
924785.pdf 76KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:29次