You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It's unclear how large a file Solr can ingest without problems (the 20G Gene.txt file appears to work), but the 100+GB Protein.txt file definitely doesn't. We should split it into smaller files for the ingest. The best place to do this is to include it in the Makefile.
The text was updated successfully, but these errors were encountered:
This can be done with split using some variant of split -d -l 10000000 Protein.txt Protein.txt.. Remember to delete the original Protein.txt once you're done to save on disk space.
It's unclear how large a file Solr can ingest without problems (the 20G Gene.txt file appears to work), but the 100+GB Protein.txt file definitely doesn't. We should split it into smaller files for the ingest. The best place to do this is to include it in the Makefile.
The text was updated successfully, but these errors were encountered: