Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release standardized dataframe after its saving in the combined job #1968

Closed
benedeki opened this issue Nov 2, 2021 · 1 comment · Fixed by #1969
Closed

Release standardized dataframe after its saving in the combined job #1968

benedeki opened this issue Nov 2, 2021 · 1 comment · Fixed by #1969
Assignees
Labels
Conformance Conformance Job affected feature New feature priority: high Critical to the health of the project Standardization Standardization Job affected

Comments

@benedeki
Copy link
Collaborator

benedeki commented Nov 2, 2021

Background

When running Standardization and Conformance in one go, the standardized data remain cached even when already in conformance phase of processing.

Feature

Release standardized dataframe after its saving in the combined job

Proposed Solution [Optional]

User DataFrame.unpersist() method.

@benedeki benedeki added feature New feature Conformance Conformance Job affected Standardization Standardization Job affected priority: high Critical to the health of the project labels Nov 2, 2021
@benedeki benedeki self-assigned this Nov 2, 2021
benedeki added a commit that referenced this issue Nov 2, 2021
…d job

* added unpersist after Standardization in` StandardizationAndConformanceJob.scala`
benedeki added a commit that referenced this issue Nov 2, 2021
…d job (#1969)

* added unpersist after Standardization in` StandardizationAndConformanceJob.scala`
@benedeki
Copy link
Collaborator Author

benedeki commented Nov 3, 2021

Release notes:
Releasing cached standardization data after its phase in the combined Standardization&Conformance job, reducing its memory requirement.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Conformance Conformance Job affected feature New feature priority: high Critical to the health of the project Standardization Standardization Job affected
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant