Skip to content

Conversation

@Borislav-Bonev-Ontotext
Copy link
Contributor

Added functionality where all documents matching the input query, up to a maximum are loaded in memory and returned as named graph collection instead of processing one by one.
When the new functionality is used the plugin is no longer streaming but will require a buffer to store the documents until the query is complete.

Added system property configuration that controls the maximum allowed batch size: graphdb.mongodb.maxBatchSize. This is to prevent OOM problems.

Added functionality where all documents matching the input query, up to a maximum are loaded in memory and returned as named graph collection instead of processing one by one.
When the new functionality is used the plugin is no longer streaming but will require a buffer to store the documents until the query is complete.

Added system property configuration that controls the maximum allowed batch size: graphdb.mongodb.maxBatchSize. This is to prevent OOM problems.
@Borislav-Bonev-Ontotext Borislav-Bonev-Ontotext changed the title Add mongo connector batch processing GDB-11761: Add mongo connector batch processing Mar 26, 2025
- Fixed the failing tests and added couple of new ones, which should
cover the specific client case.

- Added CHANGELOG file.

- Added some releases and branch information to the README.
@tonyKunchev tonyKunchev force-pushed the add-mongo-connector-batch-processing branch from 46f9b17 to 01df010 Compare July 3, 2025 11:56
@tonyKunchev tonyKunchev self-assigned this Jul 3, 2025
@tonyKunchev tonyKunchev requested a review from ivorusev July 3, 2025 12:20
@tonyKunchev tonyKunchev merged commit 40e6f89 into master Jul 3, 2025
@tonyKunchev tonyKunchev deleted the add-mongo-connector-batch-processing branch July 3, 2025 14:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants