DLPX-93376 Apache HOP project level performance configuration tuning #6
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
DLPX-93376 Apache HOP project level performance configuration tuning
Performance with Apache HOP is slightly lesser or similar to Kettle for Delimited Fixed-Width and Mainframe jobsProblem
The wait between each transform hop waits for the certain period of time which is calculated dynamically in HOP implemented by DynamicWaitTimes.java
Solution
The logic to determine if the handleGetRow method waits for a rowset was decided considering only 1 rowset, which is now modified to consider all the rowset size.
Note:Minimum sleeps vary by OS scheduler, this could be 1ms or more
Also the new version of HOP 2.10 has some performance optimization which is pulled into this forked repo.
https://github.com/apache/hop/blob/release/2.10.0/engine/src/main/java/org/apache/hop/pipeline/transform/BaseTransform.java
Testing Done
VM : https://anp-at.dlpxdc.co/masking
masking-agnostic Partial
masking-agnostic Passed after rerun
./mvnw clean install -Dmaven.test.skip=trueBonus
HOP GUI class path correction taken care similar to hop.run.sh