dpul:reindex:collections does not fully index collections #1294

hackartisan · 2022-02-03T13:58:26Z

When the reindex is run in bulk via this rake task collections do not get fully indexed. For example, when run recently on dpul-staging the shakespeareandco collection had 548 items after all the indexing was done. On prod it has 787.

This bug has been observed now in both prod and staging.

eastasian also indexed with many fewer items: 247 out of 441. However, sae_sri_lanka_dissidents and piranesi indexed completely.

When run individually via the console, shakespeareandco indexed all records.

kelea99 · 2022-04-14T02:17:34Z

@hackartisan Steve F. Just reported the Lapidus collection went down to only 336 items from over a thousand. It is the same issue I saw with Wills manuscripts collection. Something is up with reindex again. Can you have a look at what is failing?

hackartisan · 2022-04-14T13:37:52Z

According to figgy there should be 2542 complete open items in lapidus.

hackartisan · 2022-04-14T13:43:26Z

I ran Spotlight::ReindexJob.perform_later on lapidus via console and it's starting to fill up more -- currently at 590 and climbing.

I think this is better. Not sure whether it will fix any of these things but done while investigating #1294, #1291, #1340 Also #961 is interesting for context

hackartisan · 2022-05-09T18:24:51Z

In the new version of spotlight when an index job fails (and several other types of jobs are like this, too), it is not re-queued. Instead the failure is logged and then displayed on the exhibit dashboard. The error never gets to honeybadger and the dashboard shows much less information than honeybadger does.

I think we want to make the error raise still so it goes through honeybadger.

and sidekiq can retry closes #1294

hackartisan added the maintenance / research label Feb 3, 2022

hackartisan mentioned this issue Feb 3, 2022

Upgrade to Solr 8 #822

Closed

12 tasks

hackartisan mentioned this issue Apr 15, 2022

Use a Pipeline that doesn't load into solr when just fetching documents #1341

Merged

hackartisan self-assigned this Apr 15, 2022

hackartisan mentioned this issue Apr 15, 2022

Error message appeared in reindexing status for Slavic Collections in Dashboard #1340

Closed

3 tasks

hackartisan added a commit that referenced this issue May 9, 2022

Make ReindexJob raise so honeybadger can notify

24af224

and sidekiq can retry closes #1294

hackartisan mentioned this issue May 9, 2022

Make ReindexJob raise so honeybadger can notify #1346

Merged

hackartisan added a commit that referenced this issue May 9, 2022

Make ReindexJob raise so honeybadger can notify

f262ef8

and sidekiq can retry closes #1294

tpendragon closed this as completed in #1346 May 10, 2022

hackartisan mentioned this issue May 11, 2022

Actually raise so we can actually get honeybadger notifications #1349

Merged

kelea99 mentioned this issue Dec 1, 2023

dpul:reindex:collections does not appear to fully reindex collections #1491

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dpul:reindex:collections does not fully index collections #1294

dpul:reindex:collections does not fully index collections #1294

hackartisan commented Feb 3, 2022 •

edited

Loading

kelea99 commented Apr 14, 2022

hackartisan commented Apr 14, 2022

hackartisan commented Apr 14, 2022 •

edited

Loading

hackartisan commented May 9, 2022

dpul:reindex:collections does not fully index collections #1294

dpul:reindex:collections does not fully index collections #1294

Comments

hackartisan commented Feb 3, 2022 • edited Loading

kelea99 commented Apr 14, 2022

hackartisan commented Apr 14, 2022

hackartisan commented Apr 14, 2022 • edited Loading

hackartisan commented May 9, 2022

hackartisan commented Feb 3, 2022 •

edited

Loading

hackartisan commented Apr 14, 2022 •

edited

Loading