upgrade to Solr 9.2 #1359

rlskoeser · 2023-04-19T13:01:04Z

dev notes

depends on test parasolr against solr 9.2 parasolr#80
test geniza against solr 9 in CI on an upgrade branch
update geniza qa ansible playbook for solr 9 staging hosts
test geniza qa on solr 9 thoroughly
when successful, update geniza production playbook for solr 9

- also bump isort in pre-commit hook - also update comment for unit test workflow

(#1359)

blms · 2023-09-05T17:37:33Z

results from testing:

The search seems to work fine, in some cases performing better than the prod site even without the additional feature. (For example, this search is highlighted properly in the test site but not in production.)
I did get one strange outcome for shelfmark scoped search, where I think it may be evaluating type=edismax directly again after two correct results… the default parser issue coming up again somehow? (here’s that search on prod.)

blms · 2023-09-14T18:51:23Z

@rlskoeser I think I see what's going on with shelfmark scoped search: the search is being evaluated as

'keyword_query': '{!type=edismax qf=$shelfmark_qf}"T-S 8J16.25"'

which would be inserted into

'q': '{!type=edismax qf=$keyword_qf pf=$keyword_pf v=$keyword_query}'

in place of $keyword_query, producing a sort of nested or wrapped query, with two type=edismax and two qfs in different parts of the query.

It seems like this kind of nesting should work according to the docs (under "boost" here), and it does work for the first couple of results, but it then begins to evaluate type as a part of the keyword query. Maybe this kind of wrapping only works with the boost parser and not edismax?

FWIW, I also noticed that on production, that search seems to produce identical results regardless of whether you include shelfmark:, up until about result number ~375, at which point the results are so barely relevant that it seems inconsequential:

Despite that, I would say in both cases this actually performs better than the same search on QA, because it finds other shelfmarks that are similar before producing irrelevant results. Seems like it's mostly just using the boosted shelfmark fields as in a normal keyword query.

Again, odd, as I can't find any documented solr changes that would cause this difference, and the query is exactly the same in the current prod code.

Also, as for your question on Slack about whether this happens for other scoped searches—it does not, as this is the only scoped search where we actually use this kind of syntax. I think we do it because shelfmark_qf refers to multiple fields so we need to use qf. In all the other scoped searches we just replace the term with its solr field name, which is always just a single field.

rlskoeser · 2023-10-11T15:46:49Z

solr upgrade complete!

rlskoeser added the 🛠️ chore One-off task or update label Apr 19, 2023

blms self-assigned this Jun 26, 2023

blms added a commit that referenced this issue Jun 26, 2023

Upgrade solr to 9.2 (#1359)

a1edf99

blms added a commit that referenced this issue Jun 26, 2023

Upgrade parasolr to use develop branch (#1359)

71a282e

blms added a commit that referenced this issue Jun 26, 2023

Add deploy notes for solr upgrade (#1359)

c90a534

- also bump isort in pre-commit hook - also update comment for unit test workflow

blms added a commit that referenced this issue Jun 26, 2023

Upgrade solr config to use new modules directories

c97bb76

(#1359)

blms added a commit that referenced this issue Jun 26, 2023

Remove references to outdated solr cache classes

f5c2085

(#1359)

blms added a commit that referenced this issue Jun 26, 2023

Use new namespace for XSLTResponseWriter (#1359)

fa149ca

blms added a commit that referenced this issue Jul 11, 2023

Remove deprecated velocity jar (#1359)

1814cc7

blms added a commit that referenced this issue Jul 21, 2023

Revert query changes, set default parser to lucene

5dc6436

(#1359)

rlskoeser closed this as completed Oct 11, 2023

rlskoeser self-assigned this Oct 11, 2023

blms mentioned this issue Oct 18, 2023

Issues with shelfmark scoped search #1476

Closed

3 tasks

rlskoeser mentioned this issue Jan 16, 2024

upgrade to Solr 9 Princeton-CDH/ppa-django#572

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

upgrade to Solr 9.2 #1359

upgrade to Solr 9.2 #1359

rlskoeser commented Apr 19, 2023 •

edited

blms commented Sep 5, 2023

blms commented Sep 14, 2023 •

edited

rlskoeser commented Oct 11, 2023

upgrade to Solr 9.2 #1359

upgrade to Solr 9.2 #1359

Comments

rlskoeser commented Apr 19, 2023 • edited

dev notes

blms commented Sep 5, 2023

blms commented Sep 14, 2023 • edited

rlskoeser commented Oct 11, 2023

rlskoeser commented Apr 19, 2023 •

edited

blms commented Sep 14, 2023 •

edited