[HUDI-5298] Optimize WriteStatus storing HoodieRecord by xiaochen-zhou · Pull Request #8472 · apache/hudi

xiaochen-zhou · 2023-04-16T03:50:57Z

Change Logs

WriteStatus stores the entire HoodieRecord. we can optimize it to store just the required info (record key, partition path, location).

Impact

Optimize WriteStatus to store just the required info (record key, partition path, location).

Risk level (write none, low medium or high below)

low

Documentation Update

none

Contributor's checklist

Read through contributor's guide
Change Logs and Impact were stated clearly
Adequate tests were added if applicable
CI passed

danny0405 · 2023-04-17T04:00:56Z

hudi-common/src/main/java/org/apache/hudi/common/model/HoodieRecordStatus.java

+
+public class HoodieRecordStatus implements Serializable, KryoSerializable {
+
+


key + location are actually an index item, just rename it to IndexItem ?

key + location are actually an index item, just rename it to IndexItem ?

Thank you very much for your review, I have modified the code, can you re-review the code when you are free, and make some comments.

danny0405 · 2023-04-18T03:39:15Z

It is great if we can have numbers to illustrate the gains after the patch, like the cost reduction for memory or something.

xiaochen-zhou · 2023-04-19T00:51:03Z

It is great if we can have numbers to illustrate the gains after the patch, like the cost reduction for memory or something.

I would be happy to do it.

# Conflicts: # hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/inmemory/HoodieInMemoryHashIndex.java

xiaochen-zhou · 2023-04-22T14:27:42Z

It is great if we can have numbers to illustrate the gains after the patch, like the cost reduction for memory or something.

I did a test based on your suggestion:
The number of HoodieRecords is 1000 * 100 * 2

WriteStatus status = new WriteStatus(false, 1.0);
for (int i = 0; i < 1000 * 100; i++) {
  status.markSuccess(mock(HoodieRecord.class), Option.empty());
  status.markFailure(mock(HoodieRecord.class), t, Option.empty());
}
System.out.println("status memory: " + ObjectSizeCalculator.getObjectSize(status));

The memory occupied by WriteStatus before optimization is: 125512336 byte

private final List<HoodieRecord> writtenRecords = new ArrayList<>();
private final List<HoodieRecord> failedRecords = new ArrayList<>();

status memory: 125512336

The memory occupied by WriteStatus after optimization is: 427408 byte

private final List<IndexItem> writtenRecordIndexes = new ArrayList<>();
private final List<IndexItem> failedRecordIndexes = new ArrayList<>();

status memory: 427408

xiaochen-zhou · 2023-04-22T14:34:55Z

It is great if we can have numbers to illustrate the gains after the patch, like the cost reduction for memory or something.

The memory occupied by WriteStatus after optimization is about 1/300 of that before optimization ！

bvaradar

We would need to keep failed records as is for logging the complete record elsewhere cc @nsivabalan

vinothchandar · 2023-05-04T02:59:58Z

@prashantwason @nbalajee @suryaprasanna would this break you all in anyway? Do we need the record data anywhere for successful writes?

cc @rmahindra123 as well. same question.

danny0405 · 2023-05-04T05:17:09Z

hudi-common/src/main/java/org/apache/hudi/common/model/IndexItem.java

+
+import java.io.Serializable;
+
+public class IndexItem implements Serializable, KryoSerializable {


Give some doc to the class.

danny0405 · 2023-05-04T05:17:27Z

hudi-common/src/main/java/org/apache/hudi/common/model/IndexItem.java

+   * Identifies the record across the table.
+   */
+  protected HoodieKey key;
+


Can we make all these members private and final?

Can we make all these members private and final?

Thank you very much for review and sorry for the late response. I will try to modify the code according to your suggestions

Can we make all these members private and final?

We may not be able to make all these members final because they need to be reassigned

@Override public final void read(Kryo kryo, Input input) { this.key = kryo.readObjectOrNull(input, HoodieKey.class); this.currentLocation = (HoodieRecordLocation) kryo.readClassAndObject(input); this.newLocation = (HoodieRecordLocation) kryo.readClassAndObject(input); }

prashantwason · 2023-05-05T17:37:36Z

@prashantwason @nbalajee @suryaprasanna would this break you all in anyway? Do we need the record data anywhere for successful writes?

record index implementation requires the record key and the location to create the mapping in the index. This is similar requirement to other non-implicit indexes like HBaseIndex.

prashantwason · 2023-05-05T17:52:33Z

@clownxc If I understand correctly, the memory savings are coming from dropping the "data" part of the HoodieRecord? I noticed that HoodieRecord has only 2 additional members - sealed (boolean) and data (t). Are the savings due to usage of the mock class (which may have bloating compared to the original HoodieRecord)?

But hoodie write handles deflate the HoodieRecord after writing so the data portion should go away reducing the amount of savings possible.

Can you run the test again with these changes:

WriteStatus status = new WriteStatus(true, 1.0); // enable success record tracking as errors should be rare
Create an actual HoodieRecord and use that in the for loop instead of the mock(HoodieRecord.class)
Call deflate on the create HoodieRecord to remove the data as the write handles do.

I feel the above may give a more realistic view of savings.

Also, how did you find this interesting optimization? I am interested as there may be other avenues of such savings within HUDI so if would be good to know how you track these.

xiaochen-zhou · 2023-05-06T00:20:05Z

@clownxc If I understand correctly, the memory savings are coming from dropping the "data" part of the HoodieRecord? I noticed that HoodieRecord has only 2 additional members - sealed (boolean) and data (t). Are the savings due to usage of the mock class (which may have bloating compared to the original HoodieRecord)?

But hoodie write handles deflate the HoodieRecord after writing so the data portion should go away reducing the amount of savings possible.

Can you run the test again with these changes:

WriteStatus status = new WriteStatus(true, 1.0); // enable success record tracking as errors should be rare

Create an actual HoodieRecord and use that in the for loop instead of the mock(HoodieRecord.class)

Call deflate on the create HoodieRecord to remove the data as the write handles do.

I feel the above may give a more realistic view of savings.

Also, how did you find this interesting optimization? I am interested as there may be other avenues of such savings within HUDI so if would be good to know how you track these.

I would be happy to do it.

xiaochen-zhou · 2023-05-06T00:33:50Z

@clownxc If I understand correctly, the memory savings are coming from dropping the "data" part of the HoodieRecord? I noticed that HoodieRecord has only 2 additional members - sealed (boolean) and data (t). Are the savings due to usage of the mock class (which may have bloating compared to the original HoodieRecord)?

But hoodie write handles deflate the HoodieRecord after writing so the data portion should go away reducing the amount of savings possible.

Can you run the test again with these changes:

WriteStatus status = new WriteStatus(true, 1.0); // enable success record tracking as errors should be rare

Create an actual HoodieRecord and use that in the for loop instead of the mock(HoodieRecord.class)

Call deflate on the create HoodieRecord to remove the data as the write handles do.

I feel the above may give a more realistic view of savings.

Also, how did you find this interesting optimization? I am interested as there may be other avenues of such savings within HUDI so if would be good to know how you track these.

this interesting optimization was reported by @nsivabalan HUDI-5298 and has not been implemented for a long time. so I try to complete it.

hudi-bot · 2023-05-06T20:02:37Z

CI report:

ff5d944 UNKNOWN
e791402 Azure: FAILURE

Bot commands

@hudi-bot supports the following commands:

@hudi-bot run azure re-run the last Azure build

xiaochen-zhou · 2023-05-07T02:33:52Z

According to the suggestion provided by @prashantwason , I did a test as follows:

    WriteStatus status = new WriteStatus(true, 1.0);
    String partitionPath = HoodieTestDataGenerator.DEFAULT_PARTITION_PATHS[0];
    dataGen = new HoodieTestDataGenerator(new String[] {partitionPath});
    String newCommitTime = "001";
    List<HoodieRecord> records = dataGen.generateInserts(newCommitTime, 1000);
    Throwable t = new Exception("some error in writing");
    for (int i = 0; i < 1000 ; i++) {
      HoodieRecord data1 = records.get(i);
      status.markSuccess(data1, Option.empty());
      data1.deflate();
      HoodieRecord data2 = records.get(i++);
      status.markFailure(data2, t, Option.empty());
      data2.deflate();
    }
    System.out.println("status memory: " + ObjectSizeCalculator.getObjectSize(status));

It was found that the memory space occupation before(status memory: 113048) and after optimization(status memory: 117032) basically did not change, The main reason is that hoodie write handles deflate the HoodieRecord after writing and the mock class which may have bloating (I'm sorry because I didn't take these two factors into account in the previous test)
@prashantwason @danny0405 @vinothchandar

I have a doubt that if there is some optimization needed for writeStatus.markFailure if an exception occurs before record.deflate()

      writeStatus.markSuccess(hoodieRecord, recordMetadata);
      // deflate record payload after recording success. This will help users access payload as a
      // part of marking
      // record successful.
      hoodieRecord.deflate();
      return finalRecordOpt;
    } catch (Exception e) {
      LOG.error("Error writing record  " + hoodieRecord, e);
      writeStatus.markFailure(hoodieRecord, e, recordMetadata);
    }

or, In some places, there will be no deflate operation when writeStatus.markFailure

    if (indexedRecord.isPresent()) {
      // Skip the ignored record.
      try {
        if (!indexedRecord.get().shouldIgnore(writeSchema, recordProperties)) {
          recordList.add(indexedRecord.get());
        }
      } catch (IOException e) {
        writeStatus.markFailure(record, e, record.getMetadata());
        LOG.error("Error writing record  " + indexedRecord.get(), e);
      }
    }

Although the optimized effect may not have a large benefit

  public void markFailure(HoodieRecord record, Throwable t, Option<Map<String, String>> optionalRecordMetadata) {
    if (failedRecords.isEmpty() || (random.nextDouble() <= failureFraction)) {
      // Guaranteed to have at-least one error
      failedRecords.add(record);
      errors.put(record.getKey(), t);
    }
    totalRecords++;
    totalErrorRecords++;
  }

hope you leave some comments in your free time. @prashantwason @danny0405 @vinothchandar

bvaradar · 2023-05-07T03:21:56Z

@clownxc : For failed records, we need to have them logged elsewhere and so no need to deflate. For exception cases, the write status should be marked as failure. So, I don't see any reason to change this.

xiaochen-zhou · 2023-05-07T09:22:28Z

@clownxc : For failed records, we need to have them logged elsewhere and so no need to deflate. For exception cases, the write status should be marked as failure. So, I don't see any reason to change this.

I see, Thank you very much for review.

danny0405 · 2023-05-08T03:03:25Z

Cool, I think we are good to close this issue..

xiaochen-zhou added 3 commits April 16, 2023 10:27

[HUDI-5298] Optimize WriteStatus storing HoodieRecord

aa8e6b9

[HUDI-5298] Optimize WriteStatus storing HoodieRecord

ce2caee

Merge remote-tracking branch 'origin/HUDI-5298' into HUDI-5298

402e4a7

danny0405 reviewed Apr 17, 2023

View reviewed changes

[HUDI-5298] HoodieRecordStatus--->IndexItem

9da1c0d

xiaochen-zhou added 4 commits April 22, 2023 22:02

[HUDI-5298] HoodieRecordStatus--->IndexItem

7b4cd62

revert

ff5d944

Merge branch 'master' of https://github.com/clownxc/hudi into HUDI-5298

6147cc8

# Conflicts: # hudi-client/hudi-client-common/src/main/java/org/apache/hudi/index/inmemory/HoodieInMemoryHashIndex.java

merge master

d463b7d

merge master

0f0011b

bvaradar reviewed May 3, 2023

View reviewed changes

danny0405 reviewed May 4, 2023

View reviewed changes

update

c69a04b

add some doc

e791402

xiaochen-zhou closed this May 7, 2023

xiaochen-zhou reopened this May 7, 2023

danny0405 closed this May 8, 2023

hudi-bot mentioned this pull request Nov 30, 2025

Optimize WriteStatus storing HoodieRecord #15598

Open


		public class HoodieRecordStatus implements Serializable, KryoSerializable {


		import java.io.Serializable;

		public class IndexItem implements Serializable, KryoSerializable {

Comments

Conversation

xiaochen-zhou commented Apr 16, 2023

Change Logs

Impact

Risk level (write none, low medium or high below)

Documentation Update

Contributor's checklist

Uh oh!

danny0405 Apr 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xiaochen-zhou Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

danny0405 commented Apr 18, 2023

Uh oh!

xiaochen-zhou commented Apr 19, 2023

Uh oh!

xiaochen-zhou commented Apr 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xiaochen-zhou commented Apr 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bvaradar left a comment

Choose a reason for hiding this comment

Uh oh!

vinothchandar commented May 4, 2023

Uh oh!

danny0405 May 4, 2023

Choose a reason for hiding this comment

Uh oh!

danny0405 May 4, 2023

Choose a reason for hiding this comment

Uh oh!

xiaochen-zhou May 6, 2023

Choose a reason for hiding this comment

Uh oh!

xiaochen-zhou May 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

prashantwason commented May 5, 2023

Uh oh!

prashantwason commented May 5, 2023

Uh oh!

xiaochen-zhou commented May 6, 2023

Uh oh!

xiaochen-zhou commented May 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hudi-bot commented May 6, 2023

CI report:

Uh oh!

xiaochen-zhou commented May 7, 2023

Uh oh!

bvaradar commented May 7, 2023

Uh oh!

xiaochen-zhou commented May 7, 2023

Uh oh!

danny0405 commented May 8, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

danny0405 Apr 17, 2023 •

edited

Loading

xiaochen-zhou commented Apr 22, 2023 •

edited

Loading

xiaochen-zhou commented Apr 22, 2023 •

edited

Loading

xiaochen-zhou May 6, 2023 •

edited

Loading

xiaochen-zhou commented May 6, 2023 •

edited

Loading