A document presents some additional information on the subject matter of "Integrated Hardware and Software for No-Loss Computing" (NPO-42554), which appears elsewhere in this issue of NASA Tech Briefs. To recapitulate: The hardware and software designs of a developmental parallel computing system are integrated to effectuate a concept of no-loss computing (NLC). The system is designed to reconfigure an application program such that it can be monitored in real time and further reconfigured to continue a computation in the event of failure of one of the computers. The design provides for (1) a distributed class of NLC computation agents, denoted introspection agents, that effects hierarchical detection of anomalies; (2) enhancement of the compiler of the parallel computing system to cause generation of state vectors that can be used to continue a computation in the event of a failure; and (3) activation of a recovery component when an anomaly is detected.
This work was done by Mark James of Caltech for NASA's Jet Propulsion Laboratory. For further information, access the Technical Support Package (TSP) free online at www.techbriefs.com/tsp under the Information Sciences category.
The software used in this innovation is available for commercial licensing. Please contact Karina Edmonds of the California Institute of Technology at (626) 395-2322. Refer to NPO-42511
This Brief includes a Technical Support Package (TSP).

More About Software for No-Loss Computing
(reference NPO-42511) is currently available for download from the TSP library.
Don't have an account?
Overview
The document titled "More About Software for No-Loss Computing" from NASA's Jet Propulsion Laboratory discusses advancements in software technology aimed at enhancing the reliability and safety of critical flight systems. It focuses on the development of automatic real-time recovery mechanisms for flight software execution failures, which are essential for both manned and unmanned space missions.
The document outlines the challenges faced in modern computing, particularly in aerospace applications, where system components frequently fail in unpredictable ways. It emphasizes the need for continuous monitoring of system actions and the capability to handle faults transparently, ensuring that the functional properties of applications remain intact. This is crucial because restarting systems may not always be feasible during critical operations.
Failures are categorized into two types: hard failures, which result in software or hardware exceptions, and soft failures, where the program continues to run but produces incorrect results or compromises resources. The focus of the document is primarily on hard failures, as they pose significant risks to mission safety and data integrity.
The proposed technology aims to create a generic architecture that allows for automatic recovery from unexpected software failures, significantly increasing mission safety without the loss of critical flight data. This system is designed to integrate seamlessly with existing software, requiring minimal additional work from programmers, thus reducing development and debugging costs.
The document also highlights the growing complexity of computational tasks driven by scientists' demands for higher accuracy and detail. As computations become more complex, the likelihood of failures increases, particularly when tasks are distributed across multiple threads and processors. This situation necessitates robust solutions that ensure computing tools serve scientists effectively.
Applications of this technology extend beyond aerospace, with potential benefits for industries such as nuclear power, life support systems, and automotive engineering. The document underscores the importance of developing resilient systems that can withstand unexpected anomalies, thereby enhancing the overall safety and reliability of critical operations.
In summary, the document presents a comprehensive overview of innovative software solutions aimed at improving fault tolerance in flight systems, ultimately contributing to safer manned and unmanned missions in the aerospace sector and beyond.

