NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Automated clustering-based workload characterizationThe demands placed on the mass storage systems at various federal agencies and national laboratories are continuously increasing in intensity. This forces system managers to constantly monitor the system, evaluate the demand placed on it, and tune it appropriately using either heuristics based on experience or analytic models. Performance models require an accurate workload characterization. This can be a laborious and time consuming process. It became evident from our experience that a tool is necessary to automate the workload characterization process. This paper presents the design and discusses the implementation of a tool for workload characterization of mass storage systems. The main features of the tool discussed here are: (1)Automatic support for peak-period determination. Histograms of system activity are generated and presented to the user for peak-period determination; (2) Automatic clustering analysis. The data collected from the mass storage system logs is clustered using clustering algorithms and tightness measures to limit the number of generated clusters; (3) Reporting of varied file statistics. The tool computes several statistics on file sizes such as average, standard deviation, minimum, maximum, frequency, as well as average transfer time. These statistics are given on a per cluster basis; (4) Portability. The tool can easily be used to characterize the workload in mass storage systems of different vendors. The user needs to specify through a simple log description language how the a specific log should be interpreted. The rest of this paper is organized as follows. Section two presents basic concepts in workload characterization as they apply to mass storage systems. Section three describes clustering algorithms and tightness measures. The following section presents the architecture of the tool. Section five presents some results of workload characterization using the tool.Finally, section six presents some concluding remarks.
Document ID
19960051341
Acquisition Source
Goddard Space Flight Center
Document Type
Conference Paper
Authors
Pentakalos, Odysseas I.
(NASA Goddard Space Flight Center Greenbelt, MD United States)
Menasce, Daniel A.
(George Mason Univ. Fairfax, VA United States)
Yesha, Yelena
(Maryland Univ. Baltimore County Catonsville, MD United States)
Date Acquired
September 6, 2013
Publication Date
September 1, 1996
Publication Information
Publication: Fifth NASA Goddard Conference on Mass Storage Systems and Technologies.
Volume: 1
Subject Category
Documentation And Information Science
Accession Number
96N34991
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available