The use of micro-instruction cache

I learnt recently that Sandy Bridge uses a micro-instruction cache akin to the execution trace cache in Netburst. From what I know, many of the simpler x86 instructions (those translating into 4 micro-instructions or less) are decoded by hardwired decoders without any help from the microcode ROM, which is fairly a fast and simple process involving no table lookup. So, why would one cache the decoded micro-instructions and then look for them later, when they can be more easily translated by the decoder circuitry.

Is the cache only used for more complex instructions involving microcode ROM lookup? Or is there some higher wisdom involved.

Kraken

Posted 2016-08-17T21:37:56.363

Reputation: 355

Intel has generally limited instruction fetch to 16 byte blocks, the micro-op cache can provide up to 4 ops per cycle from a 32B chunk. For x86, storage of decoded instructions is more energy efficient than denser non-decoded instruction cache with dynamic decoding. The decoded cache can also be considered an L0 cache (lower latency with decent hit rate). There is also potential for decode-time optimizations. See http://www.realworldtech.com/haswell-cpu/2/

– Paul A. Clayton – 2016-08-18T01:05:18.630

So, its about the storage... – Kraken – 2016-08-18T07:18:30.453

The use of micro-instruction cache

No answers