Generate `flink-conf.yaml` file automatically to set optimum conf values #874

chandrashekar-s · 2023-11-08T09:39:29Z

Description of what I changed

Fixes #823

Enabled a feature flag to automate the generation of flink-conf.yaml based on the numThreads in application.yaml file and the number of cores available in the machine.
If the flag is disabled, then the flink-conf.yaml directed by the env. var FLINK_CONF_DIR will be used. If var is empty, then default conf will be used.
JAVA_OPTS can be set to override the default JVM heap and other parameter values.
The application will fail to launch if the JVM Off Heap memory is insufficient for the Flink cluster to launch.
Fixes the regression introduced by this change where the maxWorkers was defaulted to 1 and in turn the num of shards for parquet files was also 1.

E2E test

Tested Full Run and Incremental Run by enabling and disabling the flags. Also tested for different numThread configurations.

TESTED:

Load tested the application on a 48 core machine. Given below are the results

Run #	Run Type	# of Patient Records	Time in secs (Before Change)	Time in secs (After Change)	Remarks
1	Full	9K	400	358	Full run timing remains the same before and after the changes
2	Full (Repeat)	9K	360	256
3	Incremental	9K Upfront + 100 additional	140	30	The improvement in time after change is because of fixing the regression (increased num of shards from 1 to N)
4	Incremental	9K Upfront + 500 additional	180	32

After fixing the regression the time taken for incremental run has significantly reduced

Checklist: I completed these to help reviewers :)

I have read and will follow the review process.
I am familiar with Google Style Guides for the language I have coded in.

No? Please take some time and review Java and Python style guides.
My IDE is configured to follow the Google code styles.

No? Unsure? -> configure your IDE.
I have added tests to cover my changes. (If you refactored existing code that was well tested you do not have to add tests)
I ran mvn clean package right before creating this pull request and added all formatting changes to my commit.
All new and existing tests passed.
My pull request is based on the latest changes of the master branch.

No? Unsure? -> execute command git pull --rebase upstream master

chandrashekar-s · 2023-11-10T02:56:49Z

@bashir2 The build is failing currently, I am looking into it.

chandrashekar-s · 2023-11-20T14:52:26Z

@bashir2 The PR is ready for review. Since changes were made to create target jars compatible with jdk 17, the docker images had to be updated for streaming module.

bashir2 · 2023-11-20T17:23:32Z

@bashir2 The PR is ready for review. Since changes were made to create target jars compatible with jdk 17, the docker images had to be updated for streaming module.

Thanks @chandrashekar-s for the updates. I'll review this by tomorrow.

bashir2

Thanks @chandrashekar-s for this change.

pipelines/batch/src/main/java/com/google/fhir/analytics/ParquetMerger.java

pipelines/controller/pom.xml

pipelines/controller/src/main/java/com/google/fhir/analytics/FlinkConfiguration.java

pipelines/controller/src/main/java/com/google/fhir/analytics/PipelineManager.java

pipelines/controller/src/main/java/com/google/fhir/analytics/FlinkConfiguration.java

pipelines/controller/src/test/java/com/google/fhir/analytics/FlinkConfigurationTest.java

chandrashekar-s · 2023-11-22T13:33:32Z

@bashir2 Thanks for reviewing the changes. I have addressed/responded to the comments. Can you please have a look. Also, for few cases where unit test cases could not be written, I feel we need few more tests for e2e or performance to avoid regressions.

bashir2

Thanks @chandrashekar-s the remaining comments are all questions or minor suggestions. Please feel free to merge after addressing them.

pipelines/batch/src/main/java/com/google/fhir/analytics/ParquetMerger.java

pipelines/controller/src/main/java/com/google/fhir/analytics/FlinkConfiguration.java

pipelines/controller/config/application.yaml

pipelines/controller/src/main/java/com/google/fhir/analytics/FlinkConfiguration.java

pipelines/controller/src/main/java/com/google/fhir/analytics/PipelineManager.java

pipelines/controller/src/main/java/com/google/fhir/analytics/FlinkConfiguration.java

chandrashekar-s · 2023-11-28T15:37:47Z

Thanks @bashir2 for the reviewing the changes. I have addressed some of the comments in the latest commit. Also I have created these 2 issues #893 and #891 for Validating Flink in non-local mode and Investigating/fixing the reshuffle operations for writing to parquet files respectively.

chandrashekar-s · 2023-11-29T08:28:12Z

The performance results have been attached in the PR description. No noticeable changes for the Full run, but for Incremental run the timing has improved after fixing the number of Shards to be N (Flink parallelism)

chandrashekar-s force-pushed the automate-flink-params branch from b31e84a to 5dfb2c7 Compare November 8, 2023 09:55

chandrashekar-s requested a review from bashir2 November 8, 2023 09:56

chandrashekar-s mentioned this pull request Nov 8, 2023

Default memory configurations fail in a low resource environment #823

Closed

chandrashekar-s force-pushed the automate-flink-params branch 3 times, most recently from 0f173f4 to ab8d8f5 Compare November 20, 2023 12:40

bashir2 reviewed Nov 21, 2023

View reviewed changes

chandrashekar-s added 4 commits November 22, 2023 10:09

Generate flink-conf.yaml file automatically to set optimum conf values

e93c8f6

upgraded streaming dockerfile jdk version to 17

71db005

upgraded e2e tests dockerfile jdk version to 17

13a5973

upgraded e2e tests dockerfile python version

28d214e

chandrashekar-s force-pushed the automate-flink-params branch from ab8d8f5 to 28d214e Compare November 22, 2023 04:40

review comments

21a9ab6

bashir2 approved these changes Nov 24, 2023

View reviewed changes

bashir2 reviewed Nov 24, 2023

View reviewed changes

pipelines/controller/src/main/java/com/google/fhir/analytics/FlinkConfiguration.java Show resolved Hide resolved

Merge branch 'master' into automate-flink-params

f464de4

chandrashekar-s force-pushed the automate-flink-params branch from c692ec0 to 4a11dde Compare November 28, 2023 14:53

review comments

9cad1ef

chandrashekar-s force-pushed the automate-flink-params branch from 4a11dde to 9cad1ef Compare November 28, 2023 14:58

chandrashekar-s merged commit 7dd6df2 into google:master Nov 29, 2023
5 checks passed

bashir2 mentioned this pull request Nov 29, 2023

Attempting to fix the e2e failure #892

Closed

7 tasks

chandrashekar-s mentioned this pull request Feb 15, 2024

Enable the pipelines for Flink non-local execution modes as well #893

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate `flink-conf.yaml` file automatically to set optimum conf values #874

Generate `flink-conf.yaml` file automatically to set optimum conf values #874

chandrashekar-s commented Nov 8, 2023 •

edited

Loading

chandrashekar-s commented Nov 10, 2023

chandrashekar-s commented Nov 20, 2023

bashir2 commented Nov 20, 2023

bashir2 left a comment

chandrashekar-s commented Nov 22, 2023

bashir2 left a comment

chandrashekar-s commented Nov 28, 2023 •

edited

Loading

chandrashekar-s commented Nov 29, 2023

Generate flink-conf.yaml file automatically to set optimum conf values #874

Generate flink-conf.yaml file automatically to set optimum conf values #874

Conversation

chandrashekar-s commented Nov 8, 2023 • edited Loading

Description of what I changed

E2E test

Checklist: I completed these to help reviewers :)

chandrashekar-s commented Nov 10, 2023

chandrashekar-s commented Nov 20, 2023

bashir2 commented Nov 20, 2023

bashir2 left a comment

Choose a reason for hiding this comment

chandrashekar-s commented Nov 22, 2023

bashir2 left a comment

Choose a reason for hiding this comment

chandrashekar-s commented Nov 28, 2023 • edited Loading

chandrashekar-s commented Nov 29, 2023

Generate `flink-conf.yaml` file automatically to set optimum conf values #874

Generate `flink-conf.yaml` file automatically to set optimum conf values #874

chandrashekar-s commented Nov 8, 2023 •

edited

Loading

chandrashekar-s commented Nov 28, 2023 •

edited

Loading