Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RPMs matching re-implemented [RHELDST-6392] #158

Merged
merged 1 commit into from
Aug 26, 2021

Conversation

rbikar
Copy link
Member

@rbikar rbikar commented Aug 24, 2021

This rather bigger change moves matching non-modular binary,
debug and source packages to separate class.

This change provides 1:1 functionality as previous code, but is more
robust, reliable and uses better paralellizition and should be also
a bit faster than the previous implementation.

I also removed some unused code, added and improved tests, added
some custom config to pylint in order to pass static checks.

@rbikar rbikar force-pushed the match-rpms-effectively branch 2 times, most recently from b0a9f9e to 6cdfc1d Compare August 25, 2021 09:38
This rather bigger change moves matching non-modular binary,
debug and source packages to separate class.

This change provides 1:1 functionality as previous code, but is more
robust, reliable and uses better paralellizition and should be also
a bit faster than the previous implementation.

I also removed some unused code, added and improved tests, added
some custom config to pylint in order to pass static checks.
@rbikar
Copy link
Member Author

rbikar commented Aug 25, 2021

run tests

@rbikar rbikar marked this pull request as ready for review August 25, 2021 10:54
@rbikar
Copy link
Member Author

rbikar commented Aug 25, 2021

Failure in integration tests is a known issue: RHELDST-7382. It's missing line in logs, I asked for disabling the check.


BATCH_SIZE = int(os.getenv("UBIPOP_BATCH_SIZE", "250"))
# need to set significantly lower batches for general rpm search
# otherwise db may very likely hit OOM error.
BATCH_SIZE_RPM = int(os.getenv("UBIPOP_BATCH_SIZE_RPM", "15"))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This value is surprisingly low. If it really has to be this low to succeed, I wonder if there's something else going on here like queries being unexpectedly inefficient.

Copy link
Member Author

@rbikar rbikar Aug 26, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well, first I tested it with the batch size 250 and queries for RPMs immediately caused OOM on DB and outage on stage rhsm-pulp. Value 25 worked OK, 50 caused OOM as well. So I rather went to safer value 15.

@rbikar rbikar merged commit 1502e8d into release-engineering:master Aug 26, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants