Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimise Contract Sizes #773

Merged
merged 3 commits into from
Nov 30, 2023
Merged

Optimise Contract Sizes #773

merged 3 commits into from
Nov 30, 2023

Conversation

peterjan
Copy link
Member

@peterjan peterjan commented Nov 29, 2023

This prunable data query is rather slow because we have to LEFT JOIN contract_sectors. We need that because we want to include contracts without sectors too. I figured in this case it makes sense to split one query into two: one where we use WHERE NOT EXISTS to catch everything we would miss out on if we were to INNER JOIN.

In the contract size query (so fetching size details for a single contract) I kept the LEFT JOIN since it's bound by the fcid anyway. I considered getting rid of it all together because we only use it in the worker to escape early. We could also return a custom error and return a 400 when the user supplies a contract to prune but there's no sectors to prune. This would remove a bus route though (/contract/:id/size) and I wasn't sure whether that was a good or a bad thing.. might be usable at one point but otoh less routes less trouble... cc @ChrisSchinnerl wdyt.

NOTE: I didn't have to alter the tests because they broke when switching to an INNER JOIN

@peterjan peterjan self-assigned this Nov 29, 2023
@peterjan peterjan marked this pull request as ready for review November 29, 2023 16:17
var nullContracts []size
var dataContracts []size
if err := s.db.Transaction(func(tx *gorm.DB) error {
// use WHERE NOT EXISTS to catch contracts that would otherwise have been caught by a LEFT JOIN
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it would be bit more helpful to explain what this query does rather than what alternative we had for it. Someone who looks at this in a week probably won't remember what the left join was about.

so e.g. "fetch all contracts that don't have any useful sectors in them anymore"

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

haha yeah, fair enough

`, rhpv2.SectorSize, rhpv2.SectorSize).
Scan(&rows).
Error; err != nil {
var nullContracts []size
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

null requires some insider knowledge on the query we had before. I'd recommend something like unusedContracts and usedContracts to indicate that it's just contracts without sectors.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I updated the comments on the query to better explain what's happening. I'd like to keep nullContract because for me it's the only name that really makes sense. I don't like unusedContracts or emptyContracts because if they're unused or empty why would you be interested in their size field for pruning...

LEFT JOIN contract_sectors cs ON cs.db_contract_id = c.id
WHERE c.fcid = ?
`, rhpv2.SectorSize, rhpv2.SectorSize, fileContractID(id)).
SELECT contract_size as size, CASE WHEN contract_size > sector_size THEN contract_size - sector_size ELSE 0 END as prunable FROM (
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this new query faster? Or did you just refactor it?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's marginally faster. It avoids doing MAX and COUNT multiple times. I never really cared b/c I figured the database engine would optimise that for me.

@ChrisSchinnerl ChrisSchinnerl merged commit f3fca98 into master Nov 30, 2023
6 checks passed
@ChrisSchinnerl ChrisSchinnerl deleted the pj/contract-sizes branch November 30, 2023 16:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

None yet

2 participants