Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Too many values for UnInvertedField faceting on field object_closure #44

Closed
kshefchek opened this issue Mar 13, 2019 · 10 comments
Closed

Comments

@kshefchek
Copy link
Contributor

Solr-dev is currently returning an exception for any biolink call that facets over object_closures:
java.lang.IllegalStateException: Too many values for UnInvertedField faceting on field object_closure

We've appeared to have hit some limit, but it's not obvious from looking at the data:
Solr Production
Total Docs: 37502996
Unique values in object_closure: 4005165

Solr Dev:
Total Docs: 38759008
Unique values in object_closure: 4350271

It's also possible the limit is based on the number of values per a single document, but this is harder to gather without iterating over each document.

Related:
https://issues.apache.org/jira/browse/SOLR-11240

Possible solutions:

cc @kltm @DoctorBud @deepakunni3

@cmungall
Copy link
Contributor

4m is more than I would have expected, iI guess this is because genes, variants, etc make their way into the closure?

@kshefchek
Copy link
Contributor Author

it does seem high, I could put the values in a file so we can debug.

@kshefchek
Copy link
Contributor Author

Another solution is to only close over equivalentClass for entities such as genes, variants, etc.

@kltm
Copy link

kltm commented Mar 13, 2019 via email

@kshefchek
Copy link
Contributor Author

This occurred after a new data load, and we've tried increasing the memory.

some more stats on the object_closure field:
sum: 819945513
max: 648
mean: 21.15496642741734

All values:
object_closure.txt.gz

solr-admin

Nothing I can see in the logs when the server is starting up.

@kshefchek
Copy link
Contributor Author

Here is some more info after reindexing:

  1. Indexed on monarch5, moved files to solr-dev
  • facets on object_closure work
  1. Optimized index
  • Exception when faceting on object_closure

So oddly this is related to optimizing on the solr-dev VM, versus optimizing on monarch5, I changed this as we were running out of disk space on monarch5.

@kshefchek
Copy link
Contributor Author

I'm uncertain of the cause of this, but at least understand how to avoid it. Propose we close since it's no longer an issue.

@kshefchek
Copy link
Contributor Author

this is rearing its head again, although not for every load, and it seems re-indexing fixes it. Will play around with memory config.

@cmungall
Copy link
Contributor

cmungall commented Dec 31, 2020 via email

@kshefchek
Copy link
Contributor Author

I have to look back through my notes to see if I tried docValues. This has also been fixed in Solr 7 and above so another reason to push for an update: https://issues.apache.org/jira/browse/SOLR-11240

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants