[RISCV][LLD] Add RISCV zcmt optimise in linker relaxation #77884

Xinlong-Wu · 2024-01-12T07:37:57Z

This patch is moved from https://reviews.llvm.org/D134600 .
Considering that the LLVM's Phabricator instance has been replaced by a static archive, it is not workable to review on Phabricator anymore. So I reopened this pr and synchronized the latest changes.
I can't reopen previous pr #68551 because the previous branch has been recreated

This patch implements optimizations for the zcmt extension in lld.

A new TableJumpSectio has been added.

Scans each R_RISCV_CALL/R_RISCV_CALL_PLT relocType in each section before the linker relaxation, recording the name of the symbol.

In finalizeContentsthe recorded symbol names are sorted in descending order by the number of jumps.

Optimise and insert a new cm.jt/cm.jalt during the relax process. in the process, we reused theR_RISCV_JAL relocType

co-author: @ScottEgerton

use `DenseMap` and `CachedHashStringRef`

Xinlong-Wu · 2024-01-12T07:38:13Z

I tried linking several common applications and libraries using this patch to compare Zcmt's contribution to reducing Code Szie.

The results can be found in the Sheet or in the image below.

Besides that, I found a problem that needs to be discussed.

In TableJumpSection::finalizeContents(), the linker will abort the tbljal optimization and empty the Jump table if tbljal optimization would cause a negative optimization (i.e., the size reduction caused by the Table jump Inst is less than the size increase caused by making the Jump Table).

However, the linker still adds the symbol __jvt_base$ to .symtab . This results in a small increase in program size.

Thus, I tried to delay adding the symbol __jvt_base$ at lld/ELF/Writer.cpp and remove the symbol __jvt_base$ when the Jump Table is not empty. Something like that

But it will crash with error placeholder symbol reached writer at lld/ELF/Symbols.cpp:154. Does anyone can point out anything else should I do to add symbol delay?

github-actions · 2024-01-12T07:42:21Z

✅ With the latest revision this PR passed the C/C++ code formatter.

MaskRay · 2024-01-12T08:00:17Z

Code size reduction | 0.14% | 0.08% | 0.15% | 0.22% | 0.19% | 0.38%

Let me express my gratitude for the dedicated work you put into measuring the impact. This has been useful to figure out how useful an extension is.

As an established code size reduction feature, global pointer relaxation seems to have a saving larger than this but still needed a lot of discussions whether it was justified (since the numbers aren't great either). The zcmt numbers seem smaller while the implementation is much heavier in assembler/linker and table jump seems to get questions on hardware side whether the little code size saving justifies probably significant performance overhead. I also heard that zcmt is incompatible with another extension. It would be greatly beneficial to see a more substantial commitment from hardware vendors, given the drawbacks, minimal saving, and the considerable implementation complexity.

asb · 2024-01-16T10:56:22Z

The RISC-V code size reduction work-group did some analysis of expected code size improvements from zcmt https://docs.google.com/spreadsheets/d/1bFMyGkuuulBXuIaMsjBINoCWoLwObr1l9h5TAWN8s7k/edit#gid=1679419155 - it would be worth understanding why the results seem so different.

kito-cheng

In TableJumpSection::finalizeContents(), the linker will abort the tbljal optimization and empty the Jump table if tbljal optimization would cause a negative optimization (i.e., the size reduction caused by the Table jump Inst is less than the size increase caused by making the Jump Table).

Add a testcase to demonstrate that?

lld/ELF/Options.td

lld/ELF/Arch/RISCV.cpp

…fect

Xinlong-Wu · 2024-01-25T13:22:48Z

I'm trying remove the symble .riscv.jvt when Jump table is empty, But I have noticed .shStrTab has been fixed befor relaxation. I can't remove it from string table. string .riscv.jvt in .shStrTab will always cause a negative effect

lld/ELF/Arch/RISCV.cpp

JackGittes · 2024-02-02T03:06:14Z

lld/ELF/Arch/RISCV.cpp

+// an increase in code size (i.e. the reduction from instruction conversion
+// does not cover the code size gain from adding a table entry).
+SmallVector<llvm::detail::DenseMapPair<const Symbol *, int>, 0>
+TableJumpSection::finalizeEntry(llvm::DenseMap<const Symbol *, int> EntryMap,


finalizeEntry seems not modifying any section members and the input EntryMap?

we actually do following 3 things

sort the EntryMap as decrease order by size reduction of each item in EntryMap

drop rest if EntryMap larger then maxSize

drop the item that have a negative effect

cmuellner · 2024-10-17T10:34:47Z

This PR is tracked here: riscv-admin/dev-partners#4

Unfortunately, there were no updates for more than half a year.
Are there suggestions to move this forward? Are there any blockers?

simonpcook

I tried integrating this into a LLVM build locally, and there are a couple of issues that need fixing which I've commented on the appropriate lines.

simonpcook · 2024-11-06T13:07:59Z

lld/ELF/Arch/RISCV.cpp

+  const auto jalr = sec.contentMaybeDecompress().data()[r.offset + 4];
+  const uint8_t rd = extractBits(jalr, 11, 7);


The loading value of this jalr register is indexing an array of uint8_ts so actually is only getting the least significant bit of rd, so any jalr using anything other than zero/ra will be converted. This isn't being picked up in tests because the encoding of ra is 1.

This assumes that the jalr instruction is 4 bytes after the relocation offset. This is true for R_RISCV_CALL, but R_RISCV_JAL relocations are also processed via this function. This means the instruction after the jal instruction is both processed for its bits in the rd field and deleted if the branch target is in the table.

The same issues appear in scanTableJumpEntries, but commented once for brevity.

math-gout · 2025-01-07T09:02:06Z

lld/ELF/Arch/RISCV.cpp

+
+TableJumpSection::TableJumpSection()
+    : SyntheticSection(SHF_ALLOC | SHF_EXECINSTR, SHT_PROGBITS,
+                       config->wordsize, ".riscv.jvt") {}


According to the RISC-V unprivileged specification (27.14.3. jvt CSR) the jvt section should be aligned to 64 bytes.
"If jvt is writable, the set of values the register may hold can vary by implementation. The value in the
BASE field must always be aligned on a 64-byte boundary."

RobinKastberg · 2025-10-01T09:38:42Z

This is relevant to my interests, is there anyone working on it? I am happy to help.

Xinlong-Wu added 30 commits November 28, 2023 10:53

Add tablejump support in lld linker relaxation

4846c45

reuse reloc type R_RISCV_JAL

0d7cccc

format

a57d853

fix TODO

50502da

fix name

a7545d8

format

9153af3

fix compile erroe

1b39df2

add testcase

59111f4

change the priority order of cm.jt/cm.jalt relax

d0c5fcf

address comments

99e08c8

fix part of comments

adbd7f4

fmt

726883a

tmp

9879205

update option start with --

b912fef

use `DenseMap` and `CachedHashStringRef`

reimplement Zcmt relax

192f501

fix testcase

74cf627

rebase & update

96d0f36

git format

3e79817

write table entry to .riscv.jvt section

451a817

format

e5f78da

address comments

a7d74e9

format

c116a18

move TableJumpSection to Arch/RISCV.cpp

51b7fd3

format

5aff069

stop relax to cm.jalt if it has negative

51be215

format

d317949

fix testcase

14a988b

store symbol instade of symbol name

48be3d1

format

f9b6db9

extend sizeof InputSection

1590a32

Xinlong-Wu requested review from MaskRay, jrtc27, Hsiangkai, kito-cheng and topperc January 12, 2024 07:37

Xinlong-Wu added 2 commits January 16, 2024 14:10

format the patch

2d5ca0f

format patch again

f0be958

kito-cheng reviewed Jan 16, 2024

View reviewed changes

lld/ELF/Options.td Outdated Show resolved Hide resolved

lld/ELF/Arch/RISCV.cpp Show resolved Hide resolved

lld/ELF/Arch/RISCV.cpp Outdated Show resolved Hide resolved

rename riscv-tbljal -> relax-tbljal, use int32_t to allow negative ef…

f69a0a0

…fect

format

8897e0f

sorear mentioned this pull request Jan 26, 2024

pcc metadata after cm.j(al)t riscv/riscv-cheri#58

Closed

ChunyuLiao mentioned this pull request Jan 30, 2024

Zce LLVM (for prototype) riscv-admin/dev-partners#4

Open

13 tasks

JackGittes reviewed Jan 31, 2024

View reviewed changes

lld/ELF/Arch/RISCV.cpp Show resolved Hide resolved

JackGittes reviewed Feb 2, 2024

View reviewed changes

Xinlong-Wu and others added 7 commits March 11, 2024 09:22

address comments

4e5ee18

fmt

c50d354

Merge branch 'main' into zce-zcmt-lld

578e726

Merge remote-tracking branch 'LLVM_Upstream/main' into zce-zcmt-lld

563d704

update

d1a83f1

Merge remote-tracking branch 'LLVM_Upstream/main' into zce-zcmt-lld

99b7c6a

Merge remote-tracking branch 'github/zce-zcmt-lld' into zce-zcmt-lld

b23e351

simonpcook requested changes Nov 6, 2024

View reviewed changes

math-gout reviewed Jan 7, 2025

View reviewed changes

		const auto jalr = sec.contentMaybeDecompress().data()[r.offset + 4];
		const uint8_t rd = extractBits(jalr, 11, 7);

[RISCV][LLD] Add RISCV zcmt optimise in linker relaxation #77884

Are you sure you want to change the base?

[RISCV][LLD] Add RISCV zcmt optimise in linker relaxation #77884

Uh oh!

Conversation

Xinlong-Wu commented Jan 12, 2024

Uh oh!

Xinlong-Wu commented Jan 12, 2024

Uh oh!

github-actions bot commented Jan 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MaskRay commented Jan 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asb commented Jan 16, 2024

Uh oh!

kito-cheng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Xinlong-Wu commented Jan 25, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JackGittes Feb 2, 2024

Choose a reason for hiding this comment

Uh oh!

Xinlong-Wu Mar 11, 2024

Choose a reason for hiding this comment

Uh oh!

cmuellner commented Oct 17, 2024

Uh oh!

simonpcook left a comment

Choose a reason for hiding this comment

Uh oh!

simonpcook Nov 6, 2024

Choose a reason for hiding this comment

Uh oh!

math-gout Jan 7, 2025

Choose a reason for hiding this comment

Uh oh!

RobinKastberg commented Oct 1, 2025

Uh oh!

Uh oh!

github-actions bot commented Jan 12, 2024 •

edited

Loading

MaskRay commented Jan 12, 2024 •

edited

Loading