Home | Links | Contact Us | More About Intellectual Property | Bookmark
Search patents:
Home Fault Detection Multi-threading-for-a-processor-utilizing-a-replay-queue

 Structured document management system, structured document management method, search device and search method
The present invention has been made in view of the above circumstances and provides a structured ...


 High speed read/modify/write memory system and method
One form of the present invention is an improvement for a data processing system. The improvement ...


 Methods and apparatus for caching a location index in a data storage system
Aspects of the present invention relate to improved techniques for accessing content in a storage ...


 Method and apparatus for correcting data errors
Broadly speaking, the present invention relates to an error correcting method and apparatus which ...


 Method and apparatus for enhancing data rate in processing ECC product-coded data arrays in DVD storage subsystems and the like
What is claimed is: 1. A machine-implementable method for enhancing the data transfer rate in an ...


 Data processing method and apparatus, recording medium, reproducing method and apparatus using the same method
OF THE INVENTION Hereinafter, referring to the accompanying drawings, embodiments of the present ...


 Method for testing integrated circuits having a grid-based, "cross-check" t e
The present invention is a new test structure which allows up to 100 percent electrical testing of V...


 Serial scan chain architecture for a data processing system and method of operation
What is claimed is: 1. A data processor having a scan chain architecture, the scan chain ...


 Hierarchically managed boundary-scan testable module and method
OF THE DRAWINGS FIG. 1 is a block diagram of electronic system 10 that accommodates JTAG testing ...


 Multiple BIST controllers for testing multiple embedded memory arrays
The present invention generally relates to various models representing at least a portion of a ...


 Multi-threading for a processor utilizing a replay queue

Details
Inventors: Merchant, Amit A.; Boggs, Darrell D.; Sager, David J.;
Assignee: Intel Corporation (Santa Clara, CA)
Primary Examiner: Donaghue; Larry D.
Assistant Examiner:
Attorney, Agent or Firm: Antonelli, Terry, Stout & Kraus, LLP

A processor is provided that includes an execution unit for executing instructions and a replay system for replaying instructions which have not executed properly. The replay system is coupled to the execution unit and includes a checker for determining whether each instruction has executed properly and a plurality of replay queues or replay queue sections coupled to the checker for temporarily storing one or more instructions for replay. In one embodiment, thread-specific replay queue sections may each be used to store long latency instruction for each thread until the long latency instruction is ready to be executed (e.g., data for a load instruction has been retrieved from external memory). By storing the long latency instruction and its dependents in a replay queue section for one thread which has stalled, execution resources are made available for improving the speed of execution of other threads which have not stalled.

DETAILED DESCRIPTION I.
Introduction According to an embodiment of the present invention, a processor is provided that speculatively schedules instructions for execution and includes a replay system.
Speculative scheduling allows the scheduling latency for instructions to be reduced.
The replay system replays instructions that were not correctly executed when they were originally dispatched to an execution unit.
For example, a memory load instruction may not execute properly if there is a L0 cache miss during execution, thereby requiring the instruction to be replayed (or re-executed).
However, one challenging aspect of such a replay system is the possibility for long latency instructions to circulate through the replay system and re-execute many times before executing properly.
One example of a long latency instruction could be a memory load instruction in which there is a L0 cache miss and a L1 cache miss (i.
e.
, on-chip cache miss) on the first execution attempt.
As a result, the execution unit may then retrieve the data from an external memory device across an external bus which can be very time consuming (e.
g.
, requiring several hundred clock cycles).
The unnecessary and repeated re-execution of this long latency load instruction before its source data has returned wastes valuable execution resources, prevents other instructions from executing and increases application latency.
Where there are multiple threads, one thread can stall due to a long latency instruction, thereby inhibiting execution of the other threads.
Therefore, according to an embodiment, a replay queue is provided for temporarily storing the long latency instruction and its dependent instructions.
When the long latency instruction is ready for execution (e.
g.
, when the source data for a memory load instruction returns from external memory), the long latency instruction and the dependent instructions can then be unloaded from the replay queue for execution.
According to an embodiment, the processor may include multiple replay queues, with at least one replay queue being provided per thread or program flow (for example)



Related patents
  Checkpointing of register file
The invention in one aspect includes methodology to perform an extra read from a register file prior to writing to that register file. The data from the extra read is ...
  Data synchronization of multiple remote storage
The present invention provides a method, and architecture for implementing that method, of synchronizing two or more remote or remote data storage facilities so that ...
  Method for snooping raid 1 read transactions by a storage device
A target device snooping method, according to one embodiment of the present invention, minimizes the utilization problems of a host system and an initiator associated ...
  Method and apparatus for content distribution via non-homogeneous access networks
The invention provides a method and apparatus that is capable of streaming content to different types of access networks in an interactive information distribution ...
  Code error detecting method
It is therefore an object of the present invention to provide an error detection method of detecting a code error in a data file employing an optical disk in which, ...
  Information recording medium and information record regenerating device therefor
An object of the present invention is to provide a recording medium and an error correcting method where the burst error correcting capability and the reliability of ...
  Optical disk apparatus having error correction circuit
OF THE INVENTION There will now be described an embodiment of this invention with reference to the accompanying drawings. FIG. 1 shows an optical disk apparatus of this ...
  Method and apparatus for distinguishing control channel from traffic channels
It is accordingly an object of the present invention to speed up the process of discriminating between traffic and control channels in a received signal. Another object ...
  Method for predicting a fill-up level of a buffer in an ATM network element
Hence, it is an object of the present invention to provide an improved method for predicting a fill-up level of a buffer in an ATM network element, and also to provide ...
  DSL modem utilizing low density parity check codes
It is therefore an object of the invention to provide simple methods of generating reproducible H matrices. It is another object of the invention to provide DSL modems ...

0.014

Archive: All patents - Links

Copyright (c)2006 Eipa-patents.org - All rights reserved