Formal Linear Algebra Recovery Environment is a computer program for high-performance, fault-tolerant matrix multiplication. The program is based on an extension of the prior theory and practice of fault-tolerant matrix·matrix multiplication of the form C = AB. This extension provides low-overhead methods for detecting errors, not only in C, but also in A and/or B. These methods enable the detection of all errors as long as, in a given case, only one entry in A, B, or C is corrupted. The program also provides for following a low-overhead roll-back approach to correct errors once detected. Results of computational experiments have demonstrated that the methods implemented in this program work well in practice while imposing an acceptably low level of overhead, relative to high-performance matrix-multiplication methods that do not afford fault tolerance.

This program was written by Daniel Katz, Edwin Tisdale, Enrique Quintana-Ortí, John Gunnels, and Robert van de Geijn of Caltech for NASA's Jet Propulsion Laboratory.

This software is available for commercial licensing. Please contact Don Hart of the California Institute of Technology at (818) 393-3425. Refer to NPO-30395.



This Brief includes a Technical Support Package (TSP).
Document cover
Software for Fault-Tolerant Matrix Multiplication

(reference NPO30395) is currently available for download from the TSP library.

Don't have an account? Sign up here.