Skip to content

Wide Record reader fix.#288

Merged
prince-cs merged 1 commit intodevelopfrom
wide-record-reader
Jun 30, 2025
Merged

Wide Record reader fix.#288
prince-cs merged 1 commit intodevelopfrom
wide-record-reader

Conversation

@prince-cs
Copy link
Contributor

@prince-cs prince-cs commented Jun 10, 2025

Currently records from a single batch are being read in a parallel stream processing and the result is being stored in its entirety. This could lead to out of memory(OOM).

The fix is to ensure that the batches are being read with the help of an iterator rather storing the entire result. Made the changes accordingly to SalesforceWideRecordReader.java

Validation has been done by making a query of length more than 20k and ensuring the pipeline passes.

JIRA: https://cdap.atlassian.net/browse/PLUGIN-1897

@MrRahulSharma
Copy link
Contributor

Please add more details in the description, like the issue, RCA, fix, how did we validate etc.

Copy link
Contributor

@itsankit-google itsankit-google left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please add JIRA in the PR title.

Copy link
Contributor

@MrRahulSharma MrRahulSharma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@prince-cs prince-cs force-pushed the wide-record-reader branch from ef1ba8d to 470cebb Compare June 30, 2025 11:54
@prince-cs prince-cs force-pushed the wide-record-reader branch from 470cebb to ed8948b Compare June 30, 2025 11:55
@prince-cs prince-cs merged commit 18ee8b2 into develop Jun 30, 2025
4 of 5 checks passed
@prince-cs prince-cs deleted the wide-record-reader branch June 30, 2025 13:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants