cls/rgw: index cancelation still cleans up remove_objs #43854

cbodley · 2021-11-09T03:33:14Z

when multipart uploads complete their final bucket index transaction, they pass the list of part objects in 'remove_objs' for bulk removal - the part objects, along with their bucket stats, get replaced by the head object

but if CompleteMultipart races with another upload, the head object write will fail with ECANCELED and the bucket index transaction gets canceled with CLS_RGW_OP_CANCEL. these canceled uploads still need to clean up their 'remove_objs', but cancelation was returning too early. as a result, these bucket index entries get orphaned and leave the bucket stats inconsistent

this commit reworks rgw_bucket_complete_op() so that CLS_RGW_OP_CANCEL is handled the same way as OP_ADD and OP_DEL, so always runs the loop to clean up 'remove_objs'

Fixes: https://tracker.ceph.com/issues/53199

Checklist

References tracker ticket
Updates documentation if necessary
Includes tests for new functionality or reproducer for bug

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox

cbodley · 2021-11-09T03:37:49Z

with this fix, the bucket stats correctly show a single 32M object using the reproducer from https://tracker.ceph.com/issues/53199, even with 36 concurrent multipart uploads (30 of which were canceled):

    "usage": {
        "rgw.main": {
            "size": 33554432,
            "size_actual": 33554432,
            "size_utilized": 33554432,
            "size_kb": 32768,
            "size_kb_actual": 32768,
            "size_kb_utilized": 32768,
            "num_objects": 1
        },
        "rgw.multimeta": {
            "size": 0,
            "size_actual": 0,
            "size_utilized": 0,
            "size_kb": 0,
            "size_kb_actual": 0,
            "size_kb_utilized": 0,
            "num_objects": 0
        }

ivancich

nice!

src/cls/rgw/cls_rgw.cc

ivancich

oops, my previous review was not the approve that I intended

cbodley · 2021-11-09T15:19:04Z

thanks for the review @ivancich! i'm still planning to model multipart uploads in #43843, so that should give us good test coverage for this

github-actions · 2021-11-12T23:06:58Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

Signed-off-by: Casey Bodley <cbodley@redhat.com>

when multipart uploads complete their final bucket index transaction, they pass the list of part objects in 'remove_objs' for bulk removal - the part objects, along with their bucket stats, get replaced by the head object but if CompleteMultipart races with another upload, the head object write will fail with ECANCELED and the bucket index transaction gets canceled with CLS_RGW_OP_CANCEL. these canceled uploads still need to clean up their 'remove_objs', but cancelation was returning too early. as a result, these bucket index entries get orphaned and leave the bucket stats inconsistent this commit reworks rgw_bucket_complete_op() so that CLS_RGW_OP_CANCEL is handled the same way as OP_ADD and OP_DEL, so always runs the loop to clean up 'remove_objs' Fixes: https://tracker.ceph.com/issues/53199 Signed-off-by: Casey Bodley <cbodley@redhat.com>

whenever an index transaction uses remove_objs for complete(), it also needs to pass them for cancel() to avoid leaking index entries Signed-off-by: Casey Bodley <cbodley@redhat.com>

cbodley · 2021-11-15T18:30:27Z

rebased over #43103

cbodley · 2021-11-15T20:43:35Z

jenkins test api

cbodley · 2021-11-16T01:23:16Z

jenkins test api

cbodley added bug-fix rgw labels Nov 9, 2021

cbodley requested a review from ivancich November 9, 2021 03:33

ivancich reviewed Nov 9, 2021

View reviewed changes

src/cls/rgw/cls_rgw.cc Outdated Show resolved Hide resolved

src/cls/rgw/cls_rgw.cc Outdated Show resolved Hide resolved

src/cls/rgw/cls_rgw.cc Show resolved Hide resolved

ivancich self-requested a review November 9, 2021 15:13

ivancich approved these changes Nov 9, 2021

View reviewed changes

cbodley force-pushed the wip-53199 branch from 49a6e42 to e357f4e Compare November 9, 2021 15:54

github-actions bot added the needs-rebase label Nov 12, 2021

cbodley added 4 commits November 15, 2021 12:34

cls/rgw: helpers take const input params

d7ec0b2

Signed-off-by: Casey Bodley <cbodley@redhat.com>

cls/rgw: add complete_remove_obj() helper for remove_objs

f3325fc

Signed-off-by: Casey Bodley <cbodley@redhat.com>

rgw/rados: index transactions pass remove_objs to cancel() too

b848cca

whenever an index transaction uses remove_objs for complete(), it also needs to pass them for cancel() to avoid leaking index entries Signed-off-by: Casey Bodley <cbodley@redhat.com>

cbodley force-pushed the wip-53199 branch from e357f4e to b848cca Compare November 15, 2021 18:27

github-actions bot removed the needs-rebase label Nov 15, 2021

cbodley merged commit 696968a into ceph:master Nov 16, 2021

cbodley deleted the wip-53199 branch November 16, 2021 15:28

cbodley mentioned this pull request Feb 23, 2023

pacific: cls/rgw: remove index entry after cancelling last racing delete op #50243

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cls/rgw: index cancelation still cleans up remove_objs #43854

cls/rgw: index cancelation still cleans up remove_objs #43854

cbodley commented Nov 9, 2021

cbodley commented Nov 9, 2021

ivancich left a comment

ivancich left a comment

cbodley commented Nov 9, 2021

github-actions bot commented Nov 12, 2021

cbodley commented Nov 15, 2021

cbodley commented Nov 15, 2021

cbodley commented Nov 16, 2021

cls/rgw: index cancelation still cleans up remove_objs #43854

cls/rgw: index cancelation still cleans up remove_objs #43854

Conversation

cbodley commented Nov 9, 2021

Checklist

cbodley commented Nov 9, 2021

ivancich left a comment

Choose a reason for hiding this comment

ivancich left a comment

Choose a reason for hiding this comment

cbodley commented Nov 9, 2021

github-actions bot commented Nov 12, 2021

cbodley commented Nov 15, 2021

cbodley commented Nov 15, 2021

cbodley commented Nov 16, 2021