NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Highly parallel sparse Cholesky factorizationSeveral fine grained parallel algorithms were developed and compared to compute the Cholesky factorization of a sparse matrix. The experimental implementations are on the Connection Machine, a distributed memory SIMD machine whose programming model conceptually supplies one processor per data element. In contrast to special purpose algorithms in which the matrix structure conforms to the connection structure of the machine, the focus is on matrices with arbitrary sparsity structure. The most promising algorithm is one whose inner loop performs several dense factorizations simultaneously on a 2-D grid of processors. Virtually any massively parallel dense factorization algorithm can be used as the key subroutine. The sparse code attains execution rates comparable to those of the dense subroutine. Although at present architectural limitations prevent the dense factorization from realizing its potential efficiency, it is concluded that a regular data parallel architecture can be used efficiently to solve arbitrarily structured sparse problems. A performance model is also presented and it is used to analyze the algorithms.
Document ID
19910023533
Acquisition Source
Legacy CDMS
Document Type
Contractor Report (CR)
Authors
Gilbert, John R.
(Xerox Palo Alto Research Center CA., United States)
Schreiber, Robert
(Research Inst. for Advanced Computer Science Moffett Field, CA, United States)
Date Acquired
September 6, 2013
Publication Date
August 1, 1990
Subject Category
Computer Systems
Report/Patent Number
RIACS-TR-90-39
NASA-CR-188875
NAS 1.26:188875
Accession Number
91N32847
Funding Number(s)
CONTRACT_GRANT: NCC2-387
Distribution Limits
Public
Copyright
Work of the US Gov. Public Use Permitted.
No Preview Available