Closes #3054: Dynamically switch to batching for larger csv writes #3061

stress-tess · 2024-03-22T21:55:43Z

This PR (closes #3054) adds the ability to write chunks of data when writing csv files that would cause us to run out of memory when using the new optimizatioin. We use the arkouda memory management functions to determine approximately how much memory is available on each locale, and then divide the data we want to write on that locale into slices of that size. This makes sure the chapel native strings array is as big as possible without running out of. Hopefully we'll keep the performance bump for the small cases

jaketrookman

Great improvements, looks good to me

ajpotts

Looks good other than the questions I marked.

src/CSVMsg.chpl

… writes This PR (closes Bears-R-Us#3054) adds the ability to write chunks of data when writing csv files that would cause us to run out of memory when using the new optimizatioin. We use the arkouda memory management functions to determine approximately how much memory is available on each locale, and then divide the data we want to write on that locale into slices of that size. This makes sure the chapel native strings array is as big as possible without running out of. Hopefully we'll keep the performance bump for the small cases

stress-tess requested review from ajpotts, jaketrookman and drculhane March 22, 2024 21:55

jaketrookman approved these changes Mar 27, 2024

View reviewed changes

ajpotts approved these changes Mar 29, 2024

View reviewed changes

src/CSVMsg.chpl Outdated Show resolved Hide resolved

src/CSVMsg.chpl Outdated Show resolved Hide resolved

src/CSVMsg.chpl Outdated Show resolved Hide resolved

stress-tess added 2 commits April 1, 2024 18:57

update in response to PR feedback

4321363

stress-tess force-pushed the 3054_batch_csv_write branch from 6098e97 to 4321363 Compare April 2, 2024 01:16

stress-tess requested a review from ajpotts April 2, 2024 21:26

stress-tess merged commit 13d344a into Bears-R-Us:master Apr 3, 2024
13 checks passed

stress-tess deleted the 3054_batch_csv_write branch April 3, 2024 15:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Closes #3054: Dynamically switch to batching for larger csv writes #3061

Closes #3054: Dynamically switch to batching for larger csv writes #3061

stress-tess commented Mar 22, 2024

jaketrookman left a comment

ajpotts left a comment

Closes #3054: Dynamically switch to batching for larger csv writes #3061

Closes #3054: Dynamically switch to batching for larger csv writes #3061

Conversation

stress-tess commented Mar 22, 2024

jaketrookman left a comment

Choose a reason for hiding this comment

ajpotts left a comment

Choose a reason for hiding this comment