Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dpul:reindex:collections does not fully index collections #1294

Closed
hackartisan opened this issue Feb 3, 2022 · 4 comments · Fixed by #1346
Closed

dpul:reindex:collections does not fully index collections #1294

hackartisan opened this issue Feb 3, 2022 · 4 comments · Fixed by #1346

Comments

@hackartisan
Copy link
Member

hackartisan commented Feb 3, 2022

When the reindex is run in bulk via this rake task collections do not get fully indexed. For example, when run recently on dpul-staging the shakespeareandco collection had 548 items after all the indexing was done. On prod it has 787.

This bug has been observed now in both prod and staging.

eastasian also indexed with many fewer items: 247 out of 441. However, sae_sri_lanka_dissidents and piranesi indexed completely.

When run individually via the console, shakespeareandco indexed all records.

@kelea99
Copy link
Contributor

kelea99 commented Apr 14, 2022

@hackartisan Steve F. Just reported the Lapidus collection went down to only 336 items from over a thousand. It is the same issue I saw with Wills manuscripts collection. Something is up with reindex again. Can you have a look at what is failing?

@hackartisan
Copy link
Member Author

According to figgy there should be 2542 complete open items in lapidus.

@hackartisan
Copy link
Member Author

hackartisan commented Apr 14, 2022

I ran Spotlight::ReindexJob.perform_later on lapidus via console and it's starting to fill up more -- currently at 590 and climbing.

hackartisan added a commit that referenced this issue Apr 15, 2022
I think this is better. Not sure whether it will fix any of these things
but done while investigating #1294, #1291, #1340

Also #961 is interesting for context
@hackartisan hackartisan self-assigned this Apr 15, 2022
@hackartisan
Copy link
Member Author

In the new version of spotlight when an index job fails (and several other types of jobs are like this, too), it is not re-queued. Instead the failure is logged and then displayed on the exhibit dashboard. The error never gets to honeybadger and the dashboard shows much less information than honeybadger does.

I think we want to make the error raise still so it goes through honeybadger.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants