Skip to content

Conversation

@AnoopKumarDelphix
Copy link

@AnoopKumarDelphix AnoopKumarDelphix commented Apr 8, 2025

DLPX-93376 Apache HOP project level performance configuration tuning

Problem

Performance with Apache HOP is slightly lesser or similar to Kettle for Delimited Fixed-Width and Mainframe jobs

The wait between each transform hop waits for the certain period of time which is calculated dynamically in HOP implemented by DynamicWaitTimes.java

Solution

The logic to determine if the handleGetRow method waits for a rowset was decided considering only 1 rowset, which is now modified to consider all the rowset size.
Note:Minimum sleeps vary by OS scheduler, this could be 1ms or more

Also the new version of HOP 2.10 has some performance optimization which is pulled into this forked repo.
https://github.com/apache/hop/blob/release/2.10.0/engine/src/main/java/org/apache/hop/pipeline/transform/BaseTransform.java

Testing Done

VM : https://anp-at.dlpxdc.co/masking

masking-agnostic Partial

masking-agnostic Passed after rerun

  • Manually built the HOP binaries using ./mvnw clean install -Dmaven.test.skip=true
  • Replaced the hop-engine.jar into $MASKING_HOME/hop-runner/lib/ directory image
  • Tested all the file masking jobs functional testing is done and is working as expected.
  • Observed slight increase in performance resulting in increase in memory consumption.

Bonus

HOP GUI class path correction taken care similar to hop.run.sh

@AnoopKumarDelphix AnoopKumarDelphix force-pushed the dlpx/pr/AnoopKumarDelphix/88e1b791-1d06-4917-acfb-077a1c6a0b68 branch from a166c0d to e3d7597 Compare April 8, 2025 10:49
@AnoopKumarDelphix AnoopKumarDelphix changed the title DLPX-93560 HOP: Delimited file masking with record type is 20% slower on Apache Hop compared to Kettle DLPX-93376 Apache HOP project level performance configuration tuning Apr 14, 2025
@AnoopKumarDelphix AnoopKumarDelphix marked this pull request as ready for review April 22, 2025 08:28
Copy link

@dlpx-akashpandey dlpx-akashpandey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link

@sandeep-delphix sandeep-delphix left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, just one small comment, please fix if needed.

@AnoopKumarDelphix AnoopKumarDelphix merged commit 1ccade5 into delphix/main Apr 23, 2025
2 checks passed
@AnoopKumarDelphix AnoopKumarDelphix deleted the dlpx/pr/AnoopKumarDelphix/88e1b791-1d06-4917-acfb-077a1c6a0b68 branch April 23, 2025 07:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

5 participants