NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Automating sky object classification in astronomical survey imagesWe describe the application of machine classification techniques to the development of an automated tool for the reduction of a large scientific data set. The 2nd Palomer Observatory Sky Survey is nearly completed. This survey provides comprehensive coverage of the northern celestial hemisphere in the form of photographic plates. The plates are being transformed into digitized images whose quality will probably not be surpassed in the next ten to twenty years. The images are expected to contain on the order of 10(exp 7) galaxies and 10(exp 8) stars. Astronomers wish to determine which of these sky objects belong to various classes of galaxies and stars. The size of this data set precludes manual analysis. Our approach is to develop a software system which integrates the functions of independently developed techniques for image processing and data classification. Digitized sky images are passed through image processing routines to identify sky objects and to extract a set of features for each object. These routines are used to help select a useful set of attributes for classifying sky objects. Then GID3* and O-BTree, two inductive learning techniques, learn classification decision trees from examples. These classifiers will be used to process the rest of the data. This paper gives an overview of the machine learning techniques used, describes the details of our specific application, and reports the initial encouraging results. The results indicate that our approach is well-suited to the problem. The primary benefits of the approach are increased data reduction throughput and consistency of classification. The classification rules which are the product of the inductive learning techniques will form an object, examinable basis for classifying sky objects. A final, not to be underestimated benefit is that astronomers will be freed from the tedium of an intensely visual task to pursue more challenging analysis and interpretation problems based on automatically cataloged data.
Document ID
19930010539
Acquisition Source
Legacy CDMS
Document Type
Conference Paper
Authors
Fayyad, Usama M.
(Jet Propulsion Lab., California Inst. of Tech. Pasadena, CA, United States)
Doyle, Richard J.
(Jet Propulsion Lab., California Inst. of Tech. Pasadena, CA, United States)
Weir, Nicholas
(California Inst. of Tech. Pasadena., United States)
Djorgovski, S. G.
(California Inst. of Tech. Pasadena., United States)
Date Acquired
August 15, 2013
Publication Date
July 1, 1992
Publication Information
Publication: Wichita State Univ., Proceedings of the ML-92 Workshop on Machine Discovery (MD-92)
Subject Category
Documentation And Information Science
Accession Number
93N19728
Distribution Limits
Public
Copyright
Other

Available Downloads

There are no available downloads for this record.
No Preview Available