会议论文详细信息
The Twelfth Workshop on Algorithm Engineering and Experiments
Untangling the Braid: Finding Outliers in a Set of Streams∗
Chiranjeeb Buragohain† Luca Foschini‡ Subhash Suri§
Others  :  http://www.siam.org/proceedings/alenex/2010/alx10_015_buragohainc.pdf
PID  :  38248
来源: CEUR
PDF
【 摘 要 】
Monitoring the performance of large shared computing systems such as the cloud computing infrastructure raises many challenging algorithmic problems. One common problem is to track users with the largest deviation from the norm (outliers), for some measure of performance. Taking a stream- computing perspective, we can think of each user’s performance profile as a stream of numbers (such as response times), and the aggregate performance profile of the shared infrastructure as a “braid” of these intermixed streams. The monitoring system’s goal then is to untangle this braid sufficiently to track the top k outliers. This paper investigates the space complexity of one-pass algorithms for ap- proximating outliers of this kind, proves lower bounds using multi-party communication complexity, and proposes small- memory heuristic algorithms. On one hand, stream outliers are easily tracked for simple measures, such as max or min, but our theoretical results rule out even good approximations for most of the natural measures such as average, median, or the quantiles. On the other hand, we show through simulation that our proposed heuristics perform quite well for a variety of synthetic data.
【 预 览 】
附件列表
Files Size Format View
Untangling the Braid: Finding Outliers in a Set of Streams∗ 423KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:5次