Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improvement for exportAll batch #6302

Closed
Thanh-Thanh opened this issue Oct 22, 2019 · 2 comments
Closed

Improvement for exportAll batch #6302

Thanh-Thanh opened this issue Oct 22, 2019 · 2 comments

Comments

@Thanh-Thanh
Copy link
Contributor

Hi all,

This exportAll batch is supposed to be executed at 2 a.m each day.
It normally does a loop over all the datasets (having harvestedFrom is null).

In our case, this concerns all our datasets and so, it takes too much time to finish the job (~3 hours), without doing any exports.

We wonder if you think of an improuvement for this by limiting on the "entry" datasets of this batch?

Another one suggest : an atomic API for exporting an dataset ?

Thanks in advanced,

Thanh Thanh

@poikilotherm
Copy link
Contributor

Related to #5345

@djbrooke
Copy link
Contributor

@sekmiller @landreev any thoughts here? If this is the process, I'm wondering if there's some way to shorten this by only looking at those records that have been updated in the last 24 hours or something?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants