J. Dongarra et al. / Recursive approach in sparse matrix LU factorization
59
because the high average density of the blocks will not
be as crucial any more.
[5] C. Ashcraft, R. Grimes, J. Lewis, B. Peyton and H. Simon,
Progress in sparse matrix methods in large sparse linear sys-
tems on vector supercomputers, Intern. J. of Supercomputer
Applications 1 (1987), 10–30.
Another outstanding issue is the numerical stability
of the factorization process. As it is now, it does not
perform pivoting and still delivers acceptable accuracy.
On matrices other than those tested, the method may
still fail, and even iterative refinement may be unable
to regain sufficient accuracy. Therefore, an extended
version that performs at least some form of pivoting
would likely be much more robust.
[
6] D. Bailey, K. Lee and H. Simon, Using Strassen’s algorithm
to accelerate the solution of linear systems, The Journal of
Supercomputing 4 (1990), 357–371.
[
7] R. Barrett, M. Berry, J. Dongarra, V. Eijkhout and C. Romine,
Algorithmic bombardment for the iterative solution of linear
systems: a poly-iterative approach, JCAM 74 (1996), 91–109.
[8] E. Cuthill and J. McKee, Reducing the bandwidth of sparse
symmetric matrices, in: Proceedings of ACM National Con-
ference, Association of Computing Machinery, New York,
A parallel version of the recursive approach for
sparse matrices is also under consideration. At this
point, there are many issues to be resolved and the main
direction is still not clear. Supernodal and multifrontal
approaches use symbolical data structures from the se-
quential algorithm to assist the parallel implementa-
tion. In the recusive approach no such structures are
used and consequently parallelism has to be exploited
in some other way. On the other hand, dense codes [21,
1
969.
[
9] M. Dayde and I. Duff, Level 3 BLAS in LU factorization on
Cray-2, ETA-10P and IBM 3090-200/VF, The International
Jorunal of Supercomputer Applications 3 (1989), 40–70.
[
10] T. Davis, University of Florida Sparse Matrix Collection,
http://www.cise.ufl.edu/˜davis/sparse/,
ftp://ftp.cise.ufl.edu/pub/faculty/davis/matrices, NA Digest
94(42) (October 16, 1994), NA Digest 96(28) (July 23, 1996),
and NA Digest 97(23) (June 7, 1997).
[
11] T. Davis and I. Duff, An unsymmetric-pattern multifrontal
method for sparse LU factorization, Technical Report RAL-
93-036, Rutherford Appleton Laboratory, Chilton, Didcot,
Oxfordshire, 1994.
12] T. Davis, User’s guide for the unsymmetric-pattern multi-
frontal package (UMFPACK), Technical Report TR-93-020,
Computer and Information Sciences Department, University
of Florida, June 1993.
3
0] use recursion only locally and resort to other tech-
niques in order to expose parallelism inherent in the
factorization process [32].
[
Acknowledgments
[13] J. Demmel, S. Eisenstat, J. Gilbert, X. Li and J. Liu, A su-
pernodal approach to sparse partial pivoting, Technical report
UCB//CSD-95-883, Computer Science Division, U.C. Berke-
ley, Berkeley, California, 1995.
[14] J. Dongarra, J. Du Croz, I. Duff and S. Hammarling, A set of
Level 3 FORTRAN Basic Linear Algebra Subprograms, ACM
Transactions on Mathematical Software 16 (March 1990), 1–
This work was supported in part by the Universi-
ty of California Berkeley through subcontract num-
ber SA2283JB, as part of the prime contract ACI-
9
813362from the National Science Foundation; and by
17.
the University of California Berkeley through subcon-
tract number SA1248PG, as part of the prime contract
DEFG03-94ER25219 from the Department of Energy.
[
15] J. Dongarra, J. Du Croz, S. Hammarling and R. Hanson,
An extended set of FORTRAN Basic Linear Algebra Sub-
programs, ACM Transactions on Mathematical Software 14
(March 1988), 1–17.
[16] J. Dongarra and R. Whaley, Automatically Tuned Linear Al-
gebra Software (ATLAS), in: Proceedings of SC’89, 1989.
[17] I. Duff, A. Erisman and J. Reid, Direct methods for sparse
matrices, Oxford University Press, 1989.
References
[
18] I. Duff and J. Koster, The design and use of algorithms for
permuting large entries to the diagonal of sparse matrices,
SIAM J. Matrix Anal. Appl. 20 (1999), 889–901.
19] I. Duff, R. Grimes and J. Lewis, Sparse matrix test problems,
ACM Transactions on Mathematical Software 15 (1989), 1–
14.
[
1] R. Amestoy, T. Davis and I. Duff, An approximate minimum
degree algorithm, Technical Report TR/PA/95/09, CERFACS,
Toulouse, France.
[
[
2] E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel,
J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling,
A. McKenney and D. Sorensen, LAPACK User’s Guide, Soci-
ety for Industrial and Applied Mathematics, Philadelphia, PA,
Third edition, 1999.
[20] I. Duff and J. Reid, The multifrontal solution of indefinite
sparse symmetric linear equations, ACM Transactions on
Mathematical Software 9(3) (September 1983), 302–325.
[
3] B. Andersen, F. Gustavson and J. Wasniewski, A recursive
formulation of the Cholesky factorization operating on a ma-
trix in packed storage form, in Proceedings of the 9th SIAM
Conference on Parallel Processing for Scientific Computing,
San Antonio, TX, USA, March 24–27, 1999.
[
21] E. Elmroth and F. Gustavson, Applying recursion to serial and
parallel QR factorization leads to better performance, IBM
Journal of Research and Development 44(4) (2000), 605–624.
[22] A. George, Nested dissection of a regular finite element mesh,
SIAM Journal of Numerical Analysis 10 (1973), 345–363.
[23] N.E. Gibbs, W.G. Poole and P.K. Stockmeyer, An algorithm
for reducing the bandwidth and profile of a sparse matrix,
SIAM Journal of Numerical Analysis 13(2) (April 1976).
[
4] B. Andersen, F. Gustavson, A. Karaivanov, J. Wasniewski and
P. Yalamov, LAWRA – linear algebra with recursive algo-
rithms, in: Proceedings of the Conference on Parallel Pro-
cessing and Applied Mathematics, Kazimierz Dolny, Poland,
September 14–17, 1999.