NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
A Local Scalable Distributed Expectation Maximization Algorithm for Large Peer-to-Peer NetworksThis paper offers a local distributed algorithm for expectation maximization in large peer-to-peer environments. The algorithm can be used for a variety of well-known data mining tasks in a distributed environment such as clustering, anomaly detection, target tracking to name a few. This technology is crucial for many emerging peer-to-peer applications for bioinformatics, astronomy, social networking, sensor networks and web mining. Centralizing all or some of the data for building global models is impractical in such peer-to-peer environments because of the large number of data sources, the asynchronous nature of the peer-to-peer networks, and dynamic nature of the data/network. The distributed algorithm we have developed in this paper is provably-correct i.e. it converges to the same result compared to a similar centralized algorithm and can automatically adapt to changes to the data and the network. We show that the communication overhead of the algorithm is very low due to its local nature. This monitoring algorithm is then used as a feedback loop to sample data from the network and rebuild the model when it is outdated. We present thorough experimental results to verify our theoretical claims.
Document ID
20090037063
Acquisition Source
Ames Research Center
Document Type
Conference Paper
Authors
Bhaduri, Kanishka
(Mission Critical Technologies, Inc. Moffett Field, CA, United States)
Srivastava, Ashok N.
(NASA Ames Research Center Moffett Field, CA, United States)
Date Acquired
August 24, 2013
Publication Date
June 28, 2009
Subject Category
Computer Programming And Software
Report/Patent Number
ARC-E-DAA-TN398
Report Number: ARC-E-DAA-TN398
Meeting Information
Meeting: Knowledge Discovery and Data Mining
Location: Paris
Country: France
Start Date: June 28, 2009
End Date: July 1, 2009
Funding Number(s)
CONTRACT_GRANT: NNA04AA18B
Distribution Limits
Public
Copyright
Public Use Permitted.
No Preview Available