[RISC-V] correct comments in emitLoadImmediate #122382

credo-quia-absurdum · 2025-12-10T12:17:13Z

Summary

This PR makes three improvements to the comments in emitLoadImmediate:

Clarifies that a previous speculation in the comment is incorrect, while the heuristic still works in most cases.
Fixes a mismatch between the comment and the actual implementation (the formula ordering).
Updates the comment style to follow the repository's convention (// instead of /* */).

@clamp03 @tomeksowi @SkyShield, @namu-lee
part of #84834, cc @dotnet/samsung

Details

1. Speculation in the comment proven incorrect

The previous comment included the following speculative statement.

The smaller offset should yield the least instruction. (is this correct?)

I investigated this and confirmed that the statement is not always true.
In most cases, a smaller offset with subtract mode produces fewer instructions, but not always.

For example, for the immediate value:

0x739B'8000'FC80'05F4

The add form with the larger offset requires only 5 instructions,
while the subtract form with the smaller offset requires 8 instructions.

Addition Mode

Offset : 0xFC80'05F4

#	Instruction	Immediate	Register Value After
1	`lui`	`0x7'39B8`	`0x0000'0000'739B'8000`
2	`slli`	`11`	`0x0000'039C'DC00'0000`
3	`addi`	`0x7E4`	`0x0000'039C'DC00'07E4`
4	`slli`	`21`	`0x739B'8000'FC80'0000`
5	`addi`	`0x5F4`	`0x739B'8000'FC80'05F4`

Subtract Mode

Offset : 0x037F'FA0C

#	Instruction	Immediate	Register Value After
1	`lui`	`0x7'39B8`	`0x0000'0000'739B'8000`
2	`addiw`	`0x001`	`0x0000'0000'739B'8001`
3	`slli`	`17`	`0x0000'E737'0002'0000`
4	`addi`	`0x901`	`0x0000'E737'0001'F901`
5	`slli`	`11`	`0x0739'B800'0FC8'0800`
6	`addi`	`0x860`	`0x0739'B800'0FC8'0060`
7	`slli`	`4`	`0x739B'8000'FC80'0600`
8	`addi`	`0xFF4`	`0x739B'8000'FC80'05F4`

I experimented with an optimization that computes the exact instruction count for each mode and selects the cheaper one. However, the overall improvement was minimal.

Uniformly sampling the 64-bit space showed that only about one case per million improved compared to the existing heuristic, so I decided to drop this approach.

Although there are more cases where the larger add offset yields a better sequence, the practical impact remains small because the code generator falls back to emitDataConst whenever the required instruction count exceeds five.

2. Mismatch between comment and implementation

In the original comment, the third condition for enabling the left-shift form was written as:

(b - a) - (y - x) >= 11

However, the actual implementation evaluates the expression in the opposite order:

bool cond3 = ((y - x) - (b - a)) >= 11;

This PR corrects the comment to match the implementation and also formats the conditions in the comment to clarify the combined condition (1) && ( (2) || (3) ).

Copilot

Pull request overview

This PR corrects and improves comments in the emitLoadImmediate function for RISC-V code generation. The changes clarify a previously speculative comment that has been proven incorrect through investigation, fix a mismatch between comment and implementation for formula ordering, and modernize comment style to follow repository conventions.

Key changes:

Clarifies that smaller offsets do not always yield fewer instructions, providing a concrete counterexample
Corrects the formula in condition (3) from (b - a) - (y - x) >= 11 to (y - x) - (b - a) >= 11 to match the actual implementation
Converts all block comments from /* */ style to // style per repository conventions

src/coreclr/jit/emitriscv64.cpp

tomeksowi

LGTM, cc @fuad1502

src/coreclr/jit/emitriscv64.cpp

dotnet-policy-service · 2025-12-10T21:42:26Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

fuad1502 · 2025-12-11T03:36:06Z

Hi @credo-quia-absurdum, thank you for looking into this 🙏 I always wondered how the heuristic compared with an optimal solution. GCC, for example, if I recall correctly, uses backtracking to find the optimal solution. If you have the analysis publicly available, I would love to see it 🙏

If you want to look into optimizing this further as mentioned by @tomeksowi, you can refer to a draft PR I've made before here. I forgot why I didn't follow through with it, but maybe there's something there that can help 🙏

fuad1502

LGTM, thank you 🙏

credo-quia-absurdum · 2025-12-11T09:40:23Z

Hi @credo-quia-absurdum, thank you for looking into this 🙏 I always wondered how the heuristic compared with an optimal solution. GCC, for example, if I recall correctly, uses backtracking to find the optimal solution. If you have the analysis publicly available, I would love to see it 🙏

If you want to look into optimizing this further as mentioned by @tomeksowi, you can refer to a draft PR I've made before here. I forgot why I didn't follow through with it, but maybe there's something there that can help 🙏

@fuad1502 Here is the analysis I performed.

I reproduced the emitter logic in a small C++ program with a simple register emulator and uniformly sampled across 64-bit space. In the text files linked below, Add Only refers to the instruction count when the emitter always uses the add-offset form, while Sub Only refers to the count when it always uses the subtract-offset form. Each entry also includes the instruction count produced by the current heuristic (without the fallback to emitDataConst), along with the corresponding 64-bit immediate value in binary form.

Validating the "Smaller Offset Is Better" Assumption

First, I examined whether the assumption "the smaller offset always yields fewer instructions" holds. The results, available here, show that this assumption does not hold in a noticeable portion. In fact, 9,617 out of 1 million samples (~1%) were counterexamples where the larger offset produced a shorter instruction sequence.

Potential Improvement Over the Current Heuristic with Fallback

Next, I evaluated how much improvement would be possible if the heuristic were perfect, under the constraint that we fall back to emitDataConst whenever the instruction count exceeds five. The results are here.

For this analysis, I only tracked cases where:

the offset selected by the current heuristic yields more instructions than the alternative, and
the alternative form generates five or fewer instructions (i.e., would not trigger the fallback).

Out of 1,000,000,000 sampled immediates, only 1,389 cases (~1 per 0.7M) showed any opportunity for improvement over the current implementation. This extremely small gain is primarily due to the fallback rule: once the instruction count exceeds five, the emitter switches to emitDataConst, nullifying potential benefits.

clamp03 · 2025-12-12T02:13:04Z

@credo-quia-absurdum Please check jit format error and remove trailing whitespace.

credo-quia-absurdum · 2025-12-12T02:39:35Z

@clamp03 Thanks for pointing it out. I've updated the PR and removed the trailing whitespace.

[RISC-V] correct comments in emitLoadImmediate

9cae25b

Copilot AI review requested due to automatic review settings December 10, 2025 12:17

github-actions bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Dec 10, 2025

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Dec 10, 2025

Copilot started reviewing on behalf of credo-quia-absurdum December 10, 2025 12:18 View session

Copilot AI reviewed Dec 10, 2025

View reviewed changes

src/coreclr/jit/emitriscv64.cpp Outdated Show resolved Hide resolved

src/coreclr/jit/emitriscv64.cpp Outdated Show resolved Hide resolved

src/coreclr/jit/emitriscv64.cpp Outdated Show resolved Hide resolved

[RISC-V] fix typo in comments

e50cded

clamp03 assigned credo-quia-absurdum Dec 10, 2025

clamp03 added the arch-riscv Related to the RISC-V architecture label Dec 10, 2025

build-analysis bot mentioned this pull request Dec 10, 2025

iOS device not found in OSX.13.Amd64.Iphone.Open dotnet/dnceng#6440

Open

3 tasks

tomeksowi approved these changes Dec 10, 2025

View reviewed changes

tomeksowi reviewed Dec 10, 2025

View reviewed changes

src/coreclr/jit/emitriscv64.cpp Show resolved Hide resolved

tomeksowi reviewed Dec 10, 2025

View reviewed changes

src/coreclr/jit/emitriscv64.cpp Show resolved Hide resolved

am11 added area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI and removed needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners labels Dec 10, 2025

[RISC-V] update function header comment style

57e08b7

fuad1502 approved these changes Dec 11, 2025

View reviewed changes

clamp03 requested review from jakobbotsch and jkotas December 11, 2025 05:44

credo-quia-absurdum and others added 2 commits December 12, 2025 11:34

[RISC-V] remove trailing white space

db7eb78

Merge branch 'main' into riscv64-emitloadimm-comment-correction

e7ecfa3

clamp03 approved these changes Dec 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RISC-V] correct comments in emitLoadImmediate #122382

[RISC-V] correct comments in emitLoadImmediate #122382

Uh oh!

credo-quia-absurdum commented Dec 10, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tomeksowi left a comment

Uh oh!

Uh oh!

Uh oh!

dotnet-policy-service bot commented Dec 10, 2025

Uh oh!

fuad1502 commented Dec 11, 2025

Uh oh!

fuad1502 left a comment

Uh oh!

credo-quia-absurdum commented Dec 11, 2025 •

edited

Loading

Uh oh!

clamp03 commented Dec 12, 2025

Uh oh!

credo-quia-absurdum commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[RISC-V] correct comments in emitLoadImmediate #122382

Are you sure you want to change the base?

[RISC-V] correct comments in emitLoadImmediate #122382

Uh oh!

Conversation

credo-quia-absurdum commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

1. Speculation in the comment proven incorrect

Addition Mode

Subtract Mode

2. Mismatch between comment and implementation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tomeksowi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dotnet-policy-service bot commented Dec 10, 2025

Uh oh!

fuad1502 commented Dec 11, 2025

Uh oh!

fuad1502 left a comment

Choose a reason for hiding this comment

Uh oh!

credo-quia-absurdum commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Validating the "Smaller Offset Is Better" Assumption

Potential Improvement Over the Current Heuristic with Fallback

Uh oh!

clamp03 commented Dec 12, 2025

Uh oh!

credo-quia-absurdum commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

credo-quia-absurdum commented Dec 10, 2025 •

edited

Loading

credo-quia-absurdum commented Dec 11, 2025 •

edited

Loading