NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Due to the lapse in federal government funding, NASA is not updating this website. We sincerely regret this inconvenience.

Back to Results
Multivariate Statistical Analysis Software Technologies for Astrophysical Research Involving Large Data BasesWe developed a package to process and analyze the data from the digital version of the Second Palomar Sky Survey. This system, called SKICAT, incorporates the latest in machine learning and expert systems software technology, in order to classify the detected objects objectively and uniformly, and facilitate handling of the enormous data sets from digital sky surveys and other sources. The system provides a powerful, integrated environment for the manipulation and scientific investigation of catalogs from virtually any source. It serves three principal functions: image catalog construction, catalog management, and catalog analysis. Through use of the GID3* Decision Tree artificial induction software, SKICAT automates the process of classifying objects within CCD and digitized plate images. To exploit these catalogs, the system also provides tools to merge them into a large, complex database which may be easily queried and modified when new data or better methods of calibrating or classifying become available. The most innovative feature of SKICAT is the facility it provides to experiment with and apply the latest in machine learning technology to the tasks of catalog construction and analysis. SKICAT provides a unique environment for implementing these tools for any number of future scientific purposes. Initial scientific verification and performance tests have been made using galaxy counts and measurements of galaxy clustering from small subsets of the survey data, and a search for very high redshift quasars. All of the tests were successful and produced new and interesting scientific results. Attachments to this report give detailed accounts of the technical aspects of the SKICAT system, and of some of the scientific results achieved to date. We also developed a user-friendly package for multivariate statistical analysis of small and moderate-size data sets, called STATPROG. The package was tested extensively on a number of real scientific applications and has produced real, published results.
Document ID
19950011506
Acquisition Source
Legacy CDMS
Document Type
Contractor Report (CR)
Authors
Djorgovski, S. G.
(California Inst. of Tech. Pasadena, CA, United States)
Date Acquired
September 6, 2013
Publication Date
November 12, 1994
Subject Category
Cybernetics
Report/Patent Number
NAS 1.26:197499
NASA-CR-197499
Report Number: NAS 1.26:197499
Report Number: NASA-CR-197499
Accession Number
95N17921
Funding Number(s)
PROJECT: RTOP 656-65-07
CONTRACT_GRANT: NAS5-31348
CONTRACT_GRANT: USRA-C-5555-32
CONTRACT_GRANT: NAS5-32337
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available