NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
A real-time diagnostic and performance monitor for UNIXThere are now over one million UNIX sites and the pace at which new installations are added is steadily increasing. Along with this increase, comes a need to develop simple efficient, effective and adaptable ways of simultaneously collecting real-time diagnostic and performance data. This need exists because distributed systems can give rise to complex failure situations that are often un-identifiable with single-machine diagnostic software. The simultaneous collection of error and performance data is also important for research in failure prediction and error/performance studies. This paper introduces a portable method to concurrently collect real-time diagnostic and performance data on a distributed UNIX system. The combined diagnostic/performance data collection is implemented on a distributed multi-computer system using SUN4's as servers. The approach uses existing UNIX system facilities to gather system dependability information such as error and crash reports. In addition, performance data such as CPU utilization, disk usage, I/O transfer rate and network contention is also collected. In the future, the collected data will be used to identify dependability bottlenecks and to analyze the impact of failures on system performance.
Document ID
19930002274
Acquisition Source
Legacy CDMS
Document Type
Thesis/Dissertation
Authors
Dong, Hongchao
(Illinois Univ. Urbana-Champaign, IL, United States)
Date Acquired
September 6, 2013
Publication Date
September 1, 1992
Subject Category
Computer Operations And Hardware
Report/Patent Number
UILU-ENG-92-2232
NASA-CR-190975
NAS 1.26:190975
CRHC-92-17
Report Number: UILU-ENG-92-2232
Report Number: NASA-CR-190975
Report Number: NAS 1.26:190975
Report Number: CRHC-92-17
Accession Number
93N11462
Funding Number(s)
CONTRACT_GRANT: NAG1-613
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available