8318706: Implement JEP 423: Region Pinning for G1 by tschatzl · Pull Request #16342 · openjdk/jdk

tschatzl · 2023-10-24T09:56:57Z

The JEP covers the idea very well, so I'm only covering some implementation details here:

regions get a "pin count" (reference count). As long as it is non-zero, we conservatively never reclaim that region even if there is no reference in there. JNI code might have references to it.
the JNI spec only requires us to provide pinning support for typeArrays, nothing else. This implementation uses this in various ways:
- when evacuating from a pinned region, we evacuate everything live but the typeArrays to get more empty regions to clean up later.
- when formatting dead space within pinned regions we use filler objects. Pinned regions may be referenced by JNI code only, so we can't overwrite contents of any dead typeArray either. These dead but referenced typeArrays luckily have the same header size of our filler objects, so we can use their headers for our fillers. The problem is that previously there has been that restriction that filler objects are half a region size at most, so we can end up with the need for placing a filler object header inside a typeArray. The code could be clever and handle this situation by splitting the to be filled area so that this can't happen, but the solution taken here is allowing filler arrays to cover a whole region. They are not referenced by Java code anyway, so there is no harm in doing so (i.e. gc code never touches them anyway).
G1 currently only ever actually evacuates young pinned regions. Old pinned regions of any kind are never put into the collection set and automatically skipped. However assuming that the pinning is of short length, we put them into the candidates when we can.
- there is the problem that if an applications pins a region for a long time g1 will skip evacuating that region over and over. that may lead to issues with the current policy in marking regions (only exit mixed phase when there are no marking candidates) and just waste of processing time (when the candidate stays in the retained candidates)
  
  The cop-out chosen here is to "age out" the regions from the candidates and wait until the next marking happens.
  
  I.e. pinned marking candidates are immediately moved to retained candidates, and if in total the region has been pinned for G1NumCollectionsKeepUnreclaimable collections it is dropped from the candidates. Its current value is fairly random.
G1 pauses got a new tag if there were pinned regions in the collection set. I.e. in addition to something like:

GC(6) Pause Young (Normal) (Evacuation Failure) 1M->1M(22M) 36.16ms

there is that new tag (Pinned) that indicates that one or more regions that were pinned
were encountered during gc. E.g.

GC(6) Pause Young (Normal) (Pinned) (Allocation Failure) 1M->1M(22M) 36.16ms

Pinned and Allocation Failure tags are not exclusive. GC might have encountered both pinned
regions and allocation failed regions in the same collection or even in the same region. (I am
open to a better name for the (Pinned) tag)

Testing: tier1-8

Progress

Change must be properly reviewed (1 review required, with at least 1 Reviewer)
Change must not contain extraneous whitespace
Commit message must refer to an issue

Issue

JDK-8318706: Implement JEP 423: Region Pinning for G1 (Enhancement - P4)

Reviewers

Albert Mingkun Yang (@albertnetymk - Reviewer) ⚠️ Review applies to f9735539
Ivan Walulya (@walulyai - Reviewer) ⚠️ Review applies to f9735539
Stefan Johansson (@kstefanj - Reviewer) ⚠️ Review applies to 6395696a

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/16342/head:pull/16342
$ git checkout pull/16342

Update a local copy of the PR:
$ git checkout pull/16342
$ git pull https://git.openjdk.org/jdk.git pull/16342/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 16342

View PR using the GUI difftool:
$ git pr show -t 16342

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/16342.diff

Webrev

Link to Webrev Comment

bridgekeeper · 2023-10-24T09:58:34Z

👋 Welcome back tschatzl! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

openjdk · 2023-10-24T10:02:57Z

@tschatzl The following labels will be automatically applied to this pull request:

hotspot
serviceability

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing lists. If you would like to change these labels, use the /label pull request command.

tschatzl · 2023-10-24T11:11:27Z

/label add hotspot-gc

openjdk · 2023-10-24T11:11:49Z

@tschatzl
The hotspot-gc label was successfully added.

The JEP covers the idea very well, so I'm only covering some implementation details here: * regions get a "pin count" (reference count). As long as it is non-zero, we conservatively never reclaim that region even if there is no reference in there. JNI code might have references to it. * the JNI spec only requires us to provide pinning support for typeArrays, nothing else. This implementation uses this in various ways: * when evacuating from a pinned region, we evacuate everything live but the typeArrays to get more empty regions to clean up later. * when formatting dead space within pinned regions we use filler objects. Pinned regions may be referenced by JNI code only, so we can't overwrite contents of any dead typeArray either. These dead but referenced typeArrays luckily have the same header size of our filler objects, so we can use their headers for our fillers. The problem is that previously there has been that restriction that filler objects are half a region size at most, so we can end up with the need for placing a filler object header inside a typeArray. The code could be clever and handle this situation by splitting the to be filled area so that this can't happen, but the solution taken here is allowing filler arrays to cover a whole region. They are not referenced by Java code anyway, so there is no harm in doing so (i.e. gc code never touches them anyway). * G1 currently only ever actually evacuates young pinned regions. Old pinned regions of any kind are never put into the collection set and automatically skipped. However assuming that the pinning is of short length, we put them into the candidates when we can. * there is the problem that if an applications pins a region for a long time g1 will skip evacuating that region over and over. that may lead to issues with the current policy in marking regions (only exit mixed phase when there are no marking candidates) and just waste of processing time (when the candidate stays in the retained candidates) The cop-out chosen here is to "age out" the regions from the candidates and wait until the next marking happens. I.e. pinned marking candidates are immediately moved to retained candidates, and if in total the region has been pinned for `G1NumCollectionsKeepUnreclaimable` collections it is dropped from the candidates. Its current value is fairly random. * G1 pauses got a new tag if there were pinned regions in the collection set. I.e. in addition to something like: `GC(6) Pause Young (Normal) (Evacuation Failure) 1M->1M(22M) 36.16ms` there is that new tag `(Pinned)` that indicates that one or more regions that were pinned were encountered during gc. E.g. `GC(6) Pause Young (Normal) (Pinned) (Evacuation Failure) 1M->1M(22M) 36.16ms` `Pinned` and `Evacuation Failure` tags are not exclusive. GC might have encountered both pinned regions and evacuation failed regions in the same collection or even in the same region. whitespace fixes

…/HeapRegion.java so that resourcehogs/serviceability/sa/ClhsdbRegionDetailsScanOopsForG1.java does not fail

tschatzl · 2023-10-25T14:15:39Z

The new TestPinnedOldObjectsEvacuation.java test isn't stable, otherwise passes tier1-8. No perf changes.

I'm opening this PR for review even if this is the case, this is not a blocker for review, and fix it later.

mlbridge · 2023-10-25T14:21:02Z

Webrevs

src/hotspot/share/gc/g1/g1CollectedHeap.cpp

src/hotspot/share/gc/g1/g1ParScanThreadState.cpp

src/hotspot/share/gc/g1/g1FullGCPrepareTask.inline.hpp

src/hotspot/share/gc/g1/heapRegion.inline.hpp

src/hotspot/share/gc/g1/g1EvacFailureRegions.hpp

src/hotspot/share/gc/g1/heapRegion.cpp

tschatzl · 2023-10-31T18:54:26Z

Had a discussion with @albertnetymk and we came to the following agreement about naming:

"allocation failure" - allocation failed in the to-space due to memory exhaustion
"pinned" - the region/object has been pinned
"evacuation failure" - either pinned or allocation failure

I will apply this new naming asap.

src/hotspot/share/gc/g1/g1CollectedHeap.inline.hpp

… evacuation failure and types of it: * evacuation failure is the general concept. It includes * pinned regions * allocation failure One region can both be pinned and experience an allocation failure. G1 GC messages use tags "(Pinned)" and "(Allocation Failure)" now instead of "(Evacuation Failure)" Did not rename the G1EvacFailureInjector since this adds a lot of noise.

tschatzl · 2023-11-02T11:43:09Z

Had a discussion with @albertnetymk and we came to the following agreement about naming:

"allocation failure" - allocation failed in the to-space due to memory exhaustion
"pinned" - the region/object has been pinned
"evacuation failure" - either pinned or allocation failure

I will apply this new naming asap.

Done. I left out the G1EvacFailureInjector (it only injects allocation failures, not evacuation failures) related renamings as this adds lots of noise (including the debug options). I'll file a follow-up and assign it to me.

Tier1 seems to pass, will redo upper tiers again.

The only noteworthy externally visible change is that the (Evacuation Failure) tag in log messages is now (Allocation Failure). I did not want combinations of (Evacuation Failure) and additionally (Pinned) (Allocation Failure), but maybe it is fine, or just fine to keep only (Evacuation Failure) as before and assume that users enable higher level logging to find out details.

…oung/old generation.

src/hotspot/share/gc/g1/g1FullGCPrepareTask.inline.hpp

src/hotspot/share/gc/g1/g1GCPhaseTimes.cpp

src/hotspot/share/gc/g1/g1HeapRegionAttr.hpp

src/hotspot/share/gc/g1/g1Policy.cpp

src/hotspot/share/gc/g1/g1YoungCollector.cpp

src/hotspot/share/gc/g1/heapRegion.cpp

src/hotspot/share/gc/g1/g1GCPhaseTimes.hpp

albertnetymk · 2023-11-03T12:41:05Z

src/hotspot/share/gc/g1/g1_globals.hpp

          "retained region restore purposes.")                              \
          range(1, 256)                                                     \
                                                                            \
+  product(uint, G1NumCollectionsKeepPinned, 8, DIAGNOSTIC,                  \


Any particular reason this is not EXPERIMENTAL?

Changing this does not in any way enable risky/experimental code not fit for production. This knob is for helping diagnose performance issues.

G1 does have its fair share of experimental options, but all/most of these were from the initial import where G1 as a whole had been experimental (unstable) for some time.

This flag conceptually related (or similar) to G1RetainRegionLiveThresholdPercent, which is an exp, so I thought they should be the same category.

src/hotspot/share/gc/g1/g1Policy.hpp

src/hotspot/share/gc/g1/g1YoungCollector.cpp

openjdk · 2023-11-03T14:16:07Z

@tschatzl This change now passes all automated pre-integration checks.

ℹ️ This project also has non-automated pre-integration requirements. Please see the file CONTRIBUTING.md for details.

After integration, the commit message for the final commit will be:

8318706: Implement JEP 423: Region Pinning for G1

Reviewed-by: ayang, iwalulya, sjohanss

You can use pull request commands such as /summary, /contributor and /issue to adjust it as needed.

At the time when this comment was updated there had been 57 new commits pushed to the master branch:

115b074: 8319944: Remove DynamicDumpSharedSpaces
c0507af: 8319818: Address GCC 13.2.0 warnings (stringop-overflow and dangling-pointer)
3684b4b: 8306116: Update CLDR to Version 44.0
88ccd64: 8296250: Update ICU4J to Version 74.1
03db828: 8319650: Improve heap dump performance with class metadata caching
b41b00a: 8319820: Use unnamed variables in the FFM implementation
4d650fe: 8319704: LogTagSet::set_output_level() should not accept NULL as LogOutput
6f863b2: 8318636: Add jcmd to print annotated process memory map
e035637: 8319375: test/hotspot/jtreg/serviceability/jvmti/RedefineClasses/RedefineLeakThrowable.java runs into OutOfMemoryError: Metaspace on AIX
50f41d6: 8309893: Integrate ReplicateB/S/I/L/F/D nodes to Replicate node
... and 47 more: https://git.openjdk.org/jdk/compare/45e68ae2079336cea45dcbc39189639c06a05e0c...master

As there are no conflicts, your changes will automatically be rebased on top of these commits when integrating. If you prefer to avoid this automatic rebasing, please check the documentation for the /integrate command for further details.

➡️ To integrate this PR with the above commit message to the master branch, type /integrate in a new comment.

walulyai

LGTM!

Nits:

src/hotspot/share/gc/g1/g1CollectionSet.cpp

src/hotspot/share/gc/g1/g1YoungCollector.cpp

src/hotspot/share/gc/g1/g1FullCollector.cpp

…n-g1 Added tag jdk-22+21 for changeset d96f38b

…n-g1 Added tag jdk-22+22 for changeset d354141

openjdk · 2023-11-07T14:06:31Z

@tschatzl this pull request can not be integrated into master due to one or more merge conflicts. To resolve these merge conflicts and update this pull request you can run the following commands in the local repository for your personal fork:

git checkout submit/8318706-implementation-of-region-pinning-in-g1
git fetch https://git.openjdk.org/jdk.git master
git merge FETCH_HEAD
# resolve conflicts and follow the instructions given by git merge
git commit -m "Merge master"
git push

…n-g1

…Evacuation Failure" with a cause description (either "Allocation" or "Pinned")

kstefanj

Looks good. Just a few small things.

src/hotspot/share/gc/shared/collectedHeap.hpp

src/hotspot/share/gc/g1/g1YoungGCPostEvacuateTasks.cpp

src/hotspot/share/gc/g1/g1ParScanThreadState.cpp

- fix counting of pinned/allocation failed regions in log - some cleanup of evacuation failure code, removing unnecessary members - comments

tschatzl · 2023-11-29T10:02:31Z

Thanks @albertnetymk @kstefanj @walulyai for your reviews! Given that the JEP is now targeted, I will integrate.

This has been a fairly long journey until today... :)

/integrate

openjdk · 2023-11-29T10:03:37Z

Going to push as commit 38cfb22.
Since your change was applied there have been 267 commits pushed to the master branch:

e44d4b2: 8320858: Move jpackage tests to tier3
5dcf3a5: 8320715: Improve the tests of test/hotspot/jtreg/compiler/intrinsics/float16
78b6c2b: 8320898: exclude compiler/vectorapi/reshape/TestVectorReinterpret.java on ppc64(le) platforms
9a6ca23: 8320918: Fix errors in the built-in Catalog implementation
5e1b771: 8316422: TestIntegerUnsignedDivMod.java triggers "invalid layout" assert in FrameValues::validate
a657aa3: 8320681: [macos] Test tools/jpackage/macosx/MacAppStoreJlinkOptionsTest.java timed out on macOS
3ccd02f: 8320379: C2: Sort spilling/unspilling sequence for better ld/st merging into ldp/stp on AArch64
2c4c6c9: 8320049: PKCS10 would not discard the cause when throw SignatureException on invalid key
f93b18f: 8320932: [BACKOUT] dsymutil command leaves around temporary directories
ce4e6e2: 8320915: Update copyright year in build files
... and 257 more: https://git.openjdk.org/jdk/compare/45e68ae2079336cea45dcbc39189639c06a05e0c...master

Your commit was automatically rebased without conflicts.

openjdk · 2023-11-29T10:03:45Z

@tschatzl Pushed as commit 38cfb22.

💡 You may see a message that your pull request was closed with unmerged commits. This can be safely ignored.

pan3793 · 2025-08-12T05:54:11Z

@tschatzl thanks for your excellent work! I know that JEP 423 targets JDK 22, but I wonder if this can be backported to JDK 21. For big data workloads like Apache Spark, we do see that G1 generally performs better than other GC algorithms, but one major issue is that it heavily uses JNI for compression/decompression (e.g. zstd-jni), thus easy to OOM.

I have tested some internal Spark jobs, which were easy to OOM on JDK 21, work well on JDK 22, but given that JDK 22 has been EOL, I would appreciate it if this could be landed on JDK 21

openjdk bot changed the title ~~8318706~~ 8318706: Implementation of JDK-8276094: JEP 423: Region Pinning for G1 Oct 24, 2023

openjdk bot added serviceability serviceability-dev@openjdk.org hotspot hotspot-dev@openjdk.org labels Oct 24, 2023

openjdk bot added the hotspot-gc hotspot-gc-dev@openjdk.org label Oct 24, 2023

tschatzl force-pushed the submit/8318706-implementation-of-region-pinning-in-g1 branch from 925edf8 to 44d430a Compare October 24, 2023 11:14

Thomas Schatzl added 3 commits October 24, 2023 15:07

Fix minimal build

8b9e9a0

Fix typo in src/jdk.hotspot.agent/share/classes/sun/jvm/hotspot/gc/g1…

fddb891

…/HeapRegion.java so that resourcehogs/serviceability/sa/ClhsdbRegionDetailsScanOopsForG1.java does not fail

Improve somewhat unstable test

b882dd6

tschatzl marked this pull request as ready for review October 25, 2023 14:15

openjdk bot added the rfr Pull request is ready for review label Oct 25, 2023

albertnetymk reviewed Oct 30, 2023

View reviewed changes

Thomas Schatzl added 3 commits October 30, 2023 14:41

ayang review1

e664639

Move tests into gc.g1.pinnedobjs package

1b1d8ba

Improve TestPinnedOldObjectsEvacuation test

78cb9df

Fix compilation

e5dfbb7

walulyai reviewed Nov 1, 2023

View reviewed changes

src/hotspot/share/gc/g1/g1CollectedHeap.inline.hpp Outdated Show resolved Hide resolved

Thomas Schatzl added 2 commits November 2, 2023 09:29

NULL -> nullptr

fb1deac

Add documentation about why and how we handle pinned regions in the y…

5ae05e4

…oung/old generation.

albertnetymk reviewed Nov 3, 2023

View reviewed changes

ayang review - renamings + documentation

8342b80

albertnetymk reviewed Nov 3, 2023

View reviewed changes

typos

f973553

albertnetymk approved these changes Nov 3, 2023

View reviewed changes

openjdk bot added the ready Pull request is ready to be integrated label Nov 3, 2023

walulyai approved these changes Nov 3, 2023

View reviewed changes

src/hotspot/share/gc/g1/g1CollectionSet.cpp Outdated Show resolved Hide resolved

src/hotspot/share/gc/g1/g1YoungCollector.cpp Outdated Show resolved Hide resolved

src/hotspot/share/gc/g1/g1FullCollector.cpp Outdated Show resolved Hide resolved

tschatzl and others added 5 commits November 6, 2023 15:43

iwalulya review

d0c2c3f

Merge tag 'jdk-22+21' into 8318706-implementation-of-region-pinning-i…

251f4d3

…n-g1 Added tag jdk-22+21 for changeset d96f38b

Merge tag 'jdk-22+22' into 8318706-implementation-of-region-pinning-i…

2ad3968

…n-g1 Added tag jdk-22+22 for changeset d354141

Fix tests after merge

d9ccccf

"GCLocker Initiated GC" is not a valid GC cause for G1 any more

c272a73

openjdk bot added merge-conflict Pull request has merge conflict with target branch and removed ready Pull request is ready to be integrated labels Nov 7, 2023

Merge branch 'master' into 8318706-implementation-of-region-pinning-i…

83eff9f

…n-g1

openjdk bot removed the merge-conflict Pull request has merge conflict with target branch label Nov 7, 2023

tschatzl changed the title ~~8318706: Implementation of JDK-8276094: JEP 423: Region Pinning for G1~~ 8318706: Implement JEP 423: Region Pinning for G1 Nov 8, 2023

openjdk bot added the ready Pull request is ready to be integrated label Nov 8, 2023

Modify evacuation failure log message as suggested by sjohanss: Use "…

6395696

…Evacuation Failure" with a cause description (either "Allocation" or "Pinned")

kstefanj approved these changes Nov 10, 2023

View reviewed changes

src/hotspot/share/gc/shared/collectedHeap.hpp Outdated Show resolved Hide resolved

src/hotspot/share/gc/g1/g1YoungGCPostEvacuateTasks.cpp Outdated Show resolved Hide resolved

src/hotspot/share/gc/g1/g1ParScanThreadState.cpp Show resolved Hide resolved

stefanj review

d6df3b0

- fix counting of pinned/allocation failed regions in log - some cleanup of evacuation failure code, removing unnecessary members - comments

openjdk bot added the integrated Pull request has been integrated label Nov 29, 2023

openjdk bot closed this Nov 29, 2023

openjdk bot removed ready Pull request is ready to be integrated rfr Pull request is ready for review labels Nov 29, 2023

tschatzl deleted the submit/8318706-implementation-of-region-pinning-in-g1 branch January 16, 2024 14:48

Conversation

tschatzl commented Oct 24, 2023 • edited by openjdk bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Progress

Issue

Reviewers

Reviewing

Webrev

Uh oh!

bridgekeeper bot commented Oct 24, 2023

Uh oh!

openjdk bot commented Oct 24, 2023

Uh oh!

tschatzl commented Oct 24, 2023

Uh oh!

openjdk bot commented Oct 24, 2023

Uh oh!

tschatzl commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mlbridge bot commented Oct 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Webrevs

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tschatzl commented Oct 31, 2023

Uh oh!

Uh oh!

tschatzl commented Nov 2, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

albertnetymk Nov 3, 2023

Choose a reason for hiding this comment

Uh oh!

tschatzl Nov 3, 2023

Choose a reason for hiding this comment

Uh oh!

albertnetymk Nov 3, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

openjdk bot commented Nov 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

walulyai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

openjdk bot commented Nov 7, 2023

Uh oh!

kstefanj left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tschatzl commented Nov 29, 2023

Uh oh!

openjdk bot commented Nov 29, 2023

Uh oh!

openjdk bot commented Nov 29, 2023

Uh oh!

pan3793 commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tschatzl commented Oct 24, 2023 •

edited by openjdk bot

Loading

tschatzl commented Oct 25, 2023 •

edited

Loading

mlbridge bot commented Oct 25, 2023 •

edited

Loading

openjdk bot commented Nov 3, 2023 •

edited

Loading

pan3793 commented Aug 12, 2025 •

edited

Loading