Final report for''automated diagnosis of large scale parallel applications'' | |
Karavanic, K L | |
Lawrence Livermore National Laboratory | |
关键词: Management; Evaluation; Diagnosis; Performance; 99 General And Miscellaneous//Mathematics, Computing, And Information Science; | |
DOI : 10.2172/15005433 RP-ID : UCRL-CR-142047 RP-ID : W-7405-ENG-48 RP-ID : 15005433 |
|
美国|英语 | |
来源: UNT Digital Library | |
【 摘 要 】
The work performed is part of a continuing research project, PPerfDB, headed by Dr. Karavanic. We are studying the application of experiment management techniques to the problems associated with gathering, storing, and using performance data with the goal of achieving completely automated diagnosis of application and system bottlenecks. This summer we focused on incorporating heterogeneous data from a variety of tools, applications, and platforms, and on designing novel techniques for automated performance diagnosis. The Experiment Management paradigm is a useful approach for designing a tool that will automatically diagnose performance problems in large-scale parallel applications. The ability to gather, store, and use performance data gathered over time from different executions and using different collection tools enables more sophisticated approaches to performance diagnosis and to performance evaluation more generally. We look forward to continuing our efforts by further development and analysis of online diagnosis using historical data, and by investigating performance data and diagnosis gathered from mixed MPUOpenMP applications.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
15005433.pdf | 198KB | download |