Replace TransitiveClosure with an ObjectQueue trait #607

wks · 2022-06-06T12:19:52Z

As we discussed in #559, TransitiveClosure should be split into two traits, one for object scanning, and the other for enqueuing objects. This PR is the latter: it replaces TransitiveClosure with an object-enqueuing trait ObjectQueue, and remove TransitiveClosure completely.

With this change, the trace_object method now takes an &mut impl ObjectQueue parameter. For ProcessEdgesWork, it should pass &mut self.base().nodes instead of self to trace_object.

There is an alternative solution which uses the return value to indicates which object to enqueue. I think this PR is a more direct approach, because "enqueuing" is part of the semantics of trace_object, and it is better to just expose a queue to trace_object so it can queue, instead of letting the caller enqueue objects. Consequently, unlike the alternative solution, this PR cannot address the issue of "only updating edges when the object is actually moved" (#574). It shall be addressed in a separate PR.

Closes: #559

Now ProcessEdgesBase::nodes implement ObjectQueue instead of ProcessEdgesWork itself. Renamed parameters. - trace -> queue - T -> Q

Updated some comments. Made `enqueue` always inlined. Removed the unused `tracelocal.rs`.

wks · 2022-06-08T15:01:00Z

I think the code is ready, but I am waiting for the benchmark result.

wks · 2022-06-12T17:58:44Z

I ran Dacapo Chopin on bobcat.moma (Alder Lake), comparing the master and this branch. 20 iterations, 3 invocations each, 4.8x min heap size.

Statistics show that the effect on STW time is negligible. (Outliers (zscore >= 3) are removed.)

Geomean of relative STW time across different benchmarks:

SemiSpace: 1.0038804314826324
Immix: 0.998644089075811
GenCopy: 1.0000574290637219
GenImmix: 0.9942469216074474

FYI: Here are the scattered points for all executions (with outliers removed), together with box plots and violin plots, just in case you don't trust the means and standard deviations. Note that the plots for some plan-benchmark combinations are missing. For some combinations (such as SemiSpace-batik), GC did not happen during the execution; For some combinations, (such as luindex and pmd with generational collectors) it crashed. (Running lusearch and pmd with generational collectors will result in crashes for both master and the branch. We should investigate later.)

k-sareen · 2022-06-12T18:49:18Z

For some combinations, (such as luindex and pmd with generational collectors) it crashed. (Running lusearch and pmd with generational collectors will result in crashes for both master and the branch. We should investigate later.)

Yes, I found this earlier when I was doing a performance evaluation of MMTk and brought it up in a meeting. I recall @wenyuzhao mentioning that it might be related to the barriers. I believe he will be updating the barrier code due to his LXR PR, so it might be resolved with it.

EDIT: I believe I made an issue for it here: mmtk/mmtk-openjdk#156

Those lines of code were added by mistake.

wenyuzhao

LGTM. Just two minor questions regarding #[inline].

One thing that may be helpful is configuring codegen-units in the bindings and setting it to a low value (like 1) may improve codegen quality. So functions in mmtk-core are more likely to be inlined.

wenyuzhao · 2022-06-13T22:41:15Z

src/plan/generational/global.rs

@@ -178,42 +178,42 @@ impl<VM: VMBinding> Gen<VM> {
    }

    /// Trace objects for spaces in generational and common plans for a full heap GC.
-    pub fn trace_object_full_heap<T: TransitiveClosure>(
+    pub fn trace_object_full_heap<Q: ObjectQueue>(


Shall we mark this as #[inline(always)]? Also the trace_object_nursery function below as well

I tried. However, if I mark this as #[inline(always)], the Rust compiler will decide not to inline another call inside this function. Then I marked another function as #[inline(always)], and then the Rust compiler will again decide not to inline yet another function. And this went on and on. So I decided to fix the inlining problem later, not in this PR, because the performance of generational GC doesn't seem to be affected by this PR.

wenyuzhao · 2022-06-13T22:42:26Z

src/plan/global.rs

@@ -601,33 +601,33 @@ impl<VM: VMBinding> BasePlan<VM> {
        pages
    }

-    pub fn trace_object<T: TransitiveClosure>(
+    pub fn trace_object<Q: ObjectQueue>(


Not sure if it is helpful to mark this as #[cold] or #[inline(never)]. This may reduce the inlined code size of the hot trace_object calls.

Yes. It may reduce the inline code size. But I am not sure if we should mark it as #[cold] or #[inline(never)].

The VM space (if exists) may not be cold.

If in_space is cheap and simple, inlining it should help skipping objects that are not in those spaces.

wks added 9 commits June 5, 2022 11:06

Rename TransitiveClosure to ObjectQueue

97c31ab

Implement ObjectQueue and mass renaming.

b50f8da

Now ProcessEdgesBase::nodes implement ObjectQueue instead of ProcessEdgesWork itself. Renamed parameters. - trace -> queue - T -> Q

Remove SFTProcessEdgesMutRef

6f0f4b2

Minor fixes and formating.

d93e944

Always inline CopySpace::trace_object

21fb066

Merge branch 'master' into object-queue-trait

d5a9cfb

trace_object for other spaces and sanity checker

5b770c8

Minor changes

b315156

Updated some comments. Made `enqueue` always inlined. Removed the unused `tracelocal.rs`.

Fixed tutorial code.

03e6e08

wks marked this pull request as ready for review June 8, 2022 15:00

Remove unnecessary method implementations.

89a20ab

Those lines of code were added by mistake.

wks requested review from qinsoon and wenyuzhao June 13, 2022 02:39

wenyuzhao approved these changes Jun 13, 2022

View reviewed changes

wks merged commit 6410f0a into mmtk:master Jun 14, 2022

k-sareen mentioned this pull request Jun 23, 2022

Work packet cleanup #172

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace TransitiveClosure with an ObjectQueue trait #607

Replace TransitiveClosure with an ObjectQueue trait #607

wks commented Jun 6, 2022

wks commented Jun 8, 2022

wks commented Jun 12, 2022

k-sareen commented Jun 12, 2022 •

edited

wenyuzhao left a comment

wenyuzhao Jun 13, 2022

wks Jun 14, 2022

wenyuzhao Jun 13, 2022

wks Jun 14, 2022

Replace TransitiveClosure with an ObjectQueue trait #607

Replace TransitiveClosure with an ObjectQueue trait #607

Conversation

wks commented Jun 6, 2022

wks commented Jun 8, 2022

wks commented Jun 12, 2022

k-sareen commented Jun 12, 2022 • edited

wenyuzhao left a comment

Choose a reason for hiding this comment

wenyuzhao Jun 13, 2022

Choose a reason for hiding this comment

wks Jun 14, 2022

Choose a reason for hiding this comment

wenyuzhao Jun 13, 2022

Choose a reason for hiding this comment

wks Jun 14, 2022

Choose a reason for hiding this comment

k-sareen commented Jun 12, 2022 •

edited