NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
A balanced submatrix merging algorithm for multiprocessor architecturesIn this article, a parallel algorithm which applies Givens rotations to selectively annihilate k(k + 1)/2 nonzero elements from two k x n(k not more than n) upper trapezoidal submatrices is described. The new algorithm is suitable for implementation on either a pair of directly connected local-memory processors or two clusters of multiple tightly-coupled processors. Analyses show that in both cases the proposed algorithms achieve optimal speed-up by balancing the work load distribution and masking interprocessor or intercluster communication by computation if k is much small than n. In the context of solving large scale least squares problems, this submatrix merging step is repetitively needed during the entire computation and, furthermore, there are usually many pairs of such submatrices to be merged with each submatrix stored in the memory of a processor or a cluster of processors. The proposed algorithm can be applied to each pair of submatrices concurrently, and thus parallelizes an important step in solving the least squares problems.
Document ID
19920041868
Acquisition Source
Legacy CDMS
Document Type
Reprint (Version printed in journal)
Authors
Chu, Eleanor
(Guelph, University Canada)
George, Alan
(Waterloo, University Canada)
Date Acquired
August 15, 2013
Publication Date
January 1, 1992
Publication Information
Publication: Parallel Computing
Volume: 18
ISSN: 0167-8191
Subject Category
Computer Programming And Software
Accession Number
92A24492
Funding Number(s)
CONTRACT_GRANT: NSERC-OGP-0008111
CONTRACT_GRANT: DE-AC05-84OR-21400
CONTRACT_GRANT: NAGW-1457
Distribution Limits
Public
Copyright
Other

Available Downloads

There are no available downloads for this record.
No Preview Available