Risc-v compressed instruction alignment [closed]

Question

Closed. This question is opinion-based. It is not currently accepting answers.

Want to improve this question? Because this question may lead to opinionated discussion, debate, and answers, it has been closed. You may edit the question if you feel you can improve it so that it requires answers that include facts and citations or a detailed explanation of the proposed solution. If edited, the question will be reviewed and might be reopened.

Closed last month.

Improve this question

When supported, compressed instructions (RVC) bring a relaxation of code address alignment from 4 to 2 bytes. They also bring the possibility to intermix compressed and non-compressed instructions.

Does it make any sense, in terms of transfer performance from RAM to i-cache (instruction cache), to layout the code to keep an even number of subsequent RVC instructions?

Or is it totally irrelevant? Any reference?

fuz · Accepted Answer · 2025-09-16 15:36:47Z

3

Branch targets should be aligned, but otherwise there is no difference generally.

From the FU740-C000 manual (that's the SoC in the SiFive Unmatched)

3.2.5 Instruction Fetch Unit

The S7 instruction fetch unit is responsible for keeping the pipeline fed with instructions from memory. The instruction fetch unit delivers up to 8 bytes of instructions per clock cycle to support superscalar instruction execution. Fetches are always word-aligned and there is a one-cycle penalty for branching to a 32-bit instruction that is not word-aligned.

The S7 implements the standard Compressed (C) extension to the RISC‑V architecture, which allows for 16-bit RISC‑V instructions. As four 16-bit instructions can be fetched per cycle, the instruction fetch unit can be idle when executing programs comprised mostly of compressed 16-bit instructions. This reduces memory accesses and power consumption.

(...)

3.2.6 Branch Prediction

(...)

The BHT is a correlating predictor that supports long branch histories. The BTB has one-cycle latency, so that correctly predicted branches and direct jumps result in no penalty, provided the target is 8-byte aligned.

It is thus advisable that branch targets be aligned to 8 bytes or to at least align 32-bit instructions that are branch targets to 4 bytes. No penalty is mentioned for unaligned 32-bit instructions in an instruction stream, so there most likely is none.

edited Sep 16 at 15:36

answered Sep 16 at 14:44

fuz

94.7k27 gold badges216 silver badges391 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

fuz Sep 16 at 15:36

@EnzoR I have reverted your edit as it is incorrect. The number of bytes fetched from RAM to instruction cache is usually one cache line, which is in the ballpark of 64 or 128 bytes. It makes no sense to align to that size usually.

EnzoR Sep 16 at 15:51

"Branch target should be aligned" to 16 bit for RVC and 32 for RV?

fuz Sep 16 at 16:03

I explain this in detail in the last paragraph. Read again.

Collectives™ on Stack Overflow

Risc-v compressed instruction alignment [closed]

1 Answer 1

3.2.5 Instruction Fetch Unit

3.2.6 Branch Prediction

3 Comments

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

3.2.5 Instruction Fetch Unit

3.2.6 Branch Prediction

3 Comments

Related