NetLogger: A toolkit for distributed system performance tuning anddeb ugging | |
Tierney, Brian ; Gunter, Dan | |
Lawrence Berkeley National Laboratory | |
关键词: Monitoring; Distributed Systems Performance Analaysis; Tuning Distributed Systems Performance Analaysis; 42; Diagnosis; | |
DOI : 10.2172/924785 RP-ID : LBNL--51276 RP-ID : DE-AC02-05CH11231 RP-ID : 924785 |
|
美国|英语 | |
来源: UNT Digital Library | |
【 摘 要 】
Developers and users of high-performance distributed systemsoften observe performance problems such as unexpectedly low throughput orhigh latency. Determining the source of the performance problems requiresdetailed end-to-end instrumentation of all components, including theapplications, operating systems, hosts, and networks. In this paper wedescribe a methodology that enables the real-time diagnosis ofperformance problems in complex high-performance distributed systems. Themethodology includes tools for generating timestamped event logs that canbe used to provide detailed end-to-end application and system levelmonitoring; and tools for visualizing the log data and real-time state ofthe distributed system. This methodology, called NetLogger, has proveninvaluable for diagnosing problems in networks and in distributed systemscode. This approach is novel in that it combines network, host, andapplication-level monitoring, providing a complete view of the entiresystem. NetLogger is designed to be extremely light-weight, and includesa mechanism for reliably collecting monitoring events from multipledistributed locations. This technical report summarizes most importantpoints of several previous papers on NetLogger, and is meant to be usedas a general overview.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
924785.pdf | 76KB | download |