Home | Links | Contact Us | More About Intellectual Property | Bookmark
Search patents:
Home Processing Data Main-memory-system-and-checkpointing-protocol-for-a-fault-tolerant-computer-system-using-a-read-buffer

 Main memory system and checkpointing protocol for a fault-tolerant computer system using a read buffer

Details
Inventors: Stiffler, Jack J.;
Assignee: Texas Micro, Inc. (Houston, TX)
Primary Examiner: Decady; Albert
Assistant Examiner:
Attorney, Agent or Firm: Wolf, Greenfield & Sacks, P.C.

A mechanism for maintaining a consistent, periodically updated state in main memory without constraining normal computer operation is provided, thereby enabling a computer system to recover from faults without loss of data or processing continuity. In a typical computer system, a processor and input/output elements are connected to a main memory subsystem that includes a primary memory. In embodiments of the present invention, a read buffer is also appended to this main memory subsystem. During normal processing, a pre-image of data written to the primary memory may be captured by the read buffer. Data captured in the read buffer can restore the system to a previous checkpoint. This structure and protocol can guarantee a consistent state in main memory, thus enabling fault-tolerant operation.

DETAILED DESCRIPTION Embodiments of the present invention provide a main memory device and a process for maintaining, in a computer system, a consistent, periodically-updated, checkpoint state in the computer system's main memory while allowing the use of conventional cache coherency protocols and non-blocking caches.
The main memory device is accessed through one or more logical ports.
The main memory contains a primary memory element and a checkpoint memory element which are both coupled to the port.
The primary memory element is accessed in the same way as a standard main memory.
The checkpoint memory element captures certain data accesses from the main memory which are detectable because the checkpoint memory element is connected to the same port as the primary memory element.
The checkpoint memory element may be a physically separate memory module from the primary memory element, or the checkpoint memory element and the primary memory element may reside within the same memory module at different addresses.
These captured accesses are then used to ensure the existence of a consistent checkpoint state in the main memory.
A computer system using such a main memory device and having the appropriate detection and circumvention procedures can recover from faults without loss of data integrity or processing continuity.
In a typical computer system, a processor and input/output elements are connected to a main memory.
The connection is typically provided by one or more memory buses, cross-point switches or other mechanisms.
The main memory has one or more logical ports through which all accesses to the main memory are made.
In one embodiment of the present invention, a computer system includes a main memory and a buffer memory.
The buffer memory is generally a last-in, first-out memory, or stack, that captures a pre-image of all data that is subsequently to be written back to the main memory of the computer system.
Checkpoints are periodically established in the main memory.
Data stored in the buffer memory is discarded after each checkpoint



Related patents
  Interrupts between asynchronously operating CPUs in fault tolerant computer system
In accordance with one embodiment of the invention, a computer system employs three identical CPUs typically executing the same instruction stream, and has two identical,...
  Memory processing system and method for accessing memory including reordering memory requests to reduce mode switching
In accordance with the present invention, memory accesses are reordered to improve efficiency. A memory controller is used to arbitrate memory access requests from a ...
  Method and apparatus for snoop stretching using signals that convey snoop results
The present invention provides a protocol and related apparatus for snoop stretching in a computer system having at least one requesting agent for issuing bus ...

0.024

Archive: All patents - Links

Copyright (c)2006 Eipa-patents.org - All rights reserved