Skip to content
This repository has been archived by the owner on Jan 13, 2022. It is now read-only.

Automatically clean up after failed indexing runs #402

Closed
aldenstpage opened this issue Jan 13, 2020 · 12 comments
Closed

Automatically clean up after failed indexing runs #402

aldenstpage opened this issue Jan 13, 2020 · 12 comments
Assignees
Labels
✨ goal: improvement Improvement to an existing feature Hacktoberfest Ideal for Hacktoberfest participation help wanted Open to participation from the community 🙅 status: discontinued Not suitable for work as repo is in maintenance 🏷 status: label work required Needs proper labelling before it can be worked on

Comments

@aldenstpage
Copy link
Contributor

When an indexing job fails (such as if a node in our Elasticsearch cluster has a full disk, or a bug in indexer-worker halts the process), the incomplete index is left inside of the Elasticsearch cluster, requiring someone to manually delete it. The indexer should detect this condition when the job starts and handle it.

The production index is determined by the image alias. The indexer should delete any index NOT pointed to by this alias following the naming scheme image-<uuid>.

@aldenstpage aldenstpage added the help wanted Open to participation from the community label Jan 13, 2020
@aldenstpage aldenstpage self-assigned this Jan 13, 2020
@kgodey kgodey added this to To Be Prioritized in Backlog via automation Jan 13, 2020
@hedonhermdev
Copy link

Can I work on this issue?

@CodeMonk263
Copy link

Can i work on this issue?

@kgodey
Copy link
Contributor

kgodey commented Feb 24, 2020

@hedonhermdev go ahead. @CodeMonk263 please find another issue to work on since @hedonhermdev commented first.

@annatuma annatuma added this to Ready for Development in Active Sprint via automation Feb 28, 2020
@annatuma annatuma removed this from To Be Prioritized in Backlog Feb 28, 2020
@annatuma annatuma moved this from Ready for Development to In Progress (Community) in Active Sprint Feb 28, 2020
@DantrazTrev
Copy link
Contributor

@hedonhermdev are still working on this issue?

@hedonhermdev
Copy link

hedonhermdev commented Feb 29, 2020 via email

@DantrazTrev
Copy link
Contributor

DantrazTrev commented Feb 29, 2020

Can i take it over?
@aldenstpage

@kgodey
Copy link
Contributor

kgodey commented Mar 3, 2020

Go ahead @DantrazTrev

@aldenstpage aldenstpage removed this from In Progress (Community) in Active Sprint Apr 30, 2020
@kgodey kgodey added ✨ goal: improvement Improvement to an existing feature and removed enhancement labels Sep 24, 2020
@dhruvkb dhruvkb added the Hacktoberfest Ideal for Hacktoberfest participation label Sep 25, 2020
@tushar912
Copy link
Contributor

@DantrazTrev are u still working on this?

@kgodey
Copy link
Contributor

kgodey commented Oct 2, 2020

@tushar912 it's been a few months since @DantrazTrev's post, I think you can go ahead and work on this.

@tushar912
Copy link
Contributor

Ok

@tushar912
Copy link
Contributor

The way i understood this issue is as follows. The main indexing job is done by indexer.py in ingestion_server . The TableIndexer class contains a method _index_table which checks if the database is in sync with index and replicates if not.There are two methods of indexing. reindex which creates a new index and makes it live alias and update which updates the index. Currently during reindex if the index is not created successfully it still persists in the cluster so the job is to delete the index if indexing fails . @kgodey or @aldenstpage please tell if i have understood correctly.

@tushar912
Copy link
Contributor

Also i am thinking of modifying the already existing consistency_check method and add it to the reindex to delete the index if it is not indexed properly. Am i on the right track?

@cc-open-source-bot cc-open-source-bot added the 🏷 status: label work required Needs proper labelling before it can be worked on label Dec 2, 2020
@kgodey kgodey added this to [TEMPORARY] Deprioritize in Active Sprint Dec 2, 2020
@kgodey kgodey removed this from [TEMPORARY] Deprioritize in Active Sprint Dec 2, 2020
@kgodey kgodey added this to Pending Review in Backlog Dec 2, 2020
@kgodey kgodey added this to [TEMPORARY] Deprioritize in Active Sprint Dec 2, 2020
@kgodey kgodey removed this from [TEMPORARY] Deprioritize in Active Sprint Dec 2, 2020
@kgodey kgodey added this to [TEMPORARY] Deprioritize in Active Sprint Dec 2, 2020
@kgodey kgodey removed this from [TEMPORARY] Deprioritize in Active Sprint Dec 2, 2020
@kgodey kgodey added the 🙅 status: discontinued Not suitable for work as repo is in maintenance label Dec 16, 2020
@kgodey kgodey closed this as completed Dec 16, 2020
@kgodey kgodey moved this from Pending Review to Done in Backlog Dec 16, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
✨ goal: improvement Improvement to an existing feature Hacktoberfest Ideal for Hacktoberfest participation help wanted Open to participation from the community 🙅 status: discontinued Not suitable for work as repo is in maintenance 🏷 status: label work required Needs proper labelling before it can be worked on
Development

Successfully merging a pull request may close this issue.

8 participants