NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Determining the Number of Clusters in a Data Set Without Graphical InterpretationCluster analysis is a data mining technique that is meant ot simplify the process of classifying data points. The basic clustering process requires an input of data points and the number of clusters wanted. The clustering algorithm will then pick starting C points for the clusters, which can be either random spatial points or random data points. It then assigns each data point to the nearest C point where "nearest usually means Euclidean distance, but some algorithms use another criterion. The next step is determining whether the clustering arrangement this found is within a certain tolerance. If it falls within this tolerance, the process ends. Otherwise the C points are adjusted based on how many data points are in each cluster, and the steps repeat until the algorithm converges,
Document ID
20110016534
Acquisition Source
Ames Research Center
Document Type
Presentation
Authors
Aguirre, Nathan S.
(Hispanic Coll. Fund, Inc. Washington, DC, United States)
Davies, Misty D.
(NASA Ames Research Center Moffett Field, CA, United States)
Date Acquired
August 25, 2013
Publication Date
August 6, 2011
Subject Category
Computer Systems
Report/Patent Number
ARC-E-DAA-TN3997
Report Number: ARC-E-DAA-TN3997
Funding Number(s)
CONTRACT_GRANT: NNX08AF59A
Distribution Limits
Public
Copyright
Public Use Permitted.
No Preview Available