Detecting Abnormal Machine Characteristics in Cloud InfrastructuresIn the cloud computing environment resources are accessed as services rather than as a product. Monitoring this system for performance is crucial because of typical pay-peruse packages bought by the users for their jobs. With the huge number of machines currently in the cloud system, it is often extremely difficult for system administrators to keep track of all machines using distributed monitoring programs such as Ganglia1 which lacks system health assessment and summarization capabilities. To overcome this problem, we propose a technique for automated anomaly detection using machine performance data in the cloud. Our algorithm is entirely distributed and runs locally on each computing machine on the cloud in order to rank the machines in order of their anomalous behavior for given jobs. There is no need to centralize any of the performance data for the analysis and at the end of the analysis, our algorithm generates error reports, thereby allowing the system administrators to take corrective actions. Experiments performed on real data sets collected for different jobs validate the fact that our algorithm has a low overhead for tracking anomalous machines in a cloud infrastructure.
Document ID
20120000081
Acquisition Source
Ames Research Center
Document Type
Conference Paper
Authors
Bhaduri, Kanishka (MCT, Inc. Moffett Field, CA, United States)
Das, Kamalika (SGT, Inc. Moffett Field, CA, United States)
Matthews, Bryan L. (SGT, Inc. Moffett Field, CA, United States)
Date Acquired
August 25, 2013
Publication Date
December 10, 2011
Subject Category
Mathematical And Computer Sciences (General)
Report/Patent Number
ARC-E-DAA-TN4268Report Number: ARC-E-DAA-TN4268
Meeting Information
Meeting: IEEE International Conference on Data Mining