Skip to content

Memory Improvements in Export/Ingest#11417

Merged
ofahimIQSS merged 4 commits intoIQSS:developfrom
GlobalDataverseCommunityConsortium:Perf1
Apr 22, 2025
Merged

Memory Improvements in Export/Ingest#11417
ofahimIQSS merged 4 commits intoIQSS:developfrom
GlobalDataverseCommunityConsortium:Perf1

Conversation

@qqmyers
Copy link
Copy Markdown
Member

@qqmyers qqmyers commented Apr 9, 2025

What this PR does / why we need it: I scanned through the exporters and parts of ingest with AI and tried to address anything indicated as a potential memory/resource leak or where potentially large amounts of info were being loaded into memory.

Which issue(s) this PR closes:

  • Closes #

Special notes for your reviewer:

Suggestions on how to test this: Verify that export results, particularly the DDI export are the same for tabular files, both local files and S3 storage.

Does this PR introduce a user interface change? If mockups are available, please link/include them here:

Is there a release notes update needed for this change?:

Additional documentation:

@qqmyers qqmyers added the Size: 3 A percentage of a sprint. 2.1 hours. label Apr 9, 2025
@qqmyers qqmyers moved this to Ready for Triage in IQSS Dataverse Project Apr 9, 2025
@coveralls
Copy link
Copy Markdown

coveralls commented Apr 9, 2025

Coverage Status

coverage: 23.105% (+0.1%) from 23.007%
when pulling 248be67 on GlobalDataverseCommunityConsortium:Perf1
into cd48216 on IQSS:develop.

@ofahimIQSS ofahimIQSS moved this from Ready for Triage to Ready for Review ⏩ in IQSS Dataverse Project Apr 15, 2025
@landreev landreev self-requested a review April 15, 2025 15:30
@landreev landreev self-assigned this Apr 15, 2025
@scolapasta scolapasta moved this from Ready for Review ⏩ to In Review 🔎 in IQSS Dataverse Project Apr 16, 2025
@qqmyers qqmyers marked this pull request as ready for review April 17, 2025 22:38
@github-project-automation github-project-automation Bot moved this from In Review 🔎 to Ready for QA ⏩ in IQSS Dataverse Project Apr 21, 2025
@landreev
Copy link
Copy Markdown
Contributor

(out of scope of the pr, but this reminds me that we need to retire UNF. I don't think it costs us all that much, performance-wise, since we are subsetting the entire datafile into variable vectors to calculate sum. stats anyway. but it's not clear if it adds any value at this point either).

@ofahimIQSS ofahimIQSS self-assigned this Apr 21, 2025
@ofahimIQSS ofahimIQSS moved this from Ready for QA ⏩ to QA ✅ in IQSS Dataverse Project Apr 21, 2025
@ofahimIQSS
Copy link
Copy Markdown
Contributor

Hello, can I get an update with latest from dev for this.

@ofahimIQSS
Copy link
Copy Markdown
Contributor

looks good from my end, tested as described and performed further regression testing of other export metadata. merging.

@ofahimIQSS ofahimIQSS merged commit 14da5bd into IQSS:develop Apr 22, 2025
16 checks passed
@github-project-automation github-project-automation Bot moved this from QA ✅ to Merged 🚀 in IQSS Dataverse Project Apr 22, 2025
@ofahimIQSS ofahimIQSS removed their assignment Apr 22, 2025
@cmbz cmbz added the FY25 Sprint 21 FY25 Sprint 21 (2025-04-09 - 2025-04-23) label Apr 23, 2025
@scolapasta scolapasta moved this from Merged 🚀 to Done 🧹 in IQSS Dataverse Project Apr 23, 2025
@pdurbin pdurbin added this to the 6.7 milestone Apr 29, 2025
@qqmyers qqmyers mentioned this pull request May 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

FY25 Sprint 21 FY25 Sprint 21 (2025-04-09 - 2025-04-23) Size: 3 A percentage of a sprint. 2.1 hours.

Projects

Status: Done 🧹

Development

Successfully merging this pull request may close these issues.

7 participants