rgw: disable RGWDataChangesLog::add_entry() when log_data is off #45357

cbodley · 2022-03-11T14:52:37Z

this restores a check for RGWZone::log_data in add_entry(). with per-bucket replication, this check was replaced by a call to RGWBucketSyncPolicyHandler::bucket_exports_data()

this call has to consult two rados objects, bucket.sync-source-hints.<bucketname> and bucket.sync-target-hints.<bucketname>

but if the zone is not configured for multisite, we should avoid these extra object reads and return early

Fixes: https://tracker.ceph.com/issues/54531

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox
jenkins test windows

this restores a check for RGWZone::log_data in add_entry(). with per-bucket replication, this check was replaced by a call to `RGWBucketSyncPolicyHandler::bucket_exports_data()` this call has to consult two rados objects, `bucket.sync-source-hints.<bucketname>` and `bucket.sync-target-hints.<bucketname>` but if the zone is not configured for multisite, we should avoid these extra object reads and return early Fixes: https://tracker.ceph.com/issues/54531 Signed-off-by: Casey Bodley <cbodley@redhat.com>

cbodley · 2022-03-11T14:54:13Z

@yehudasa i know that someday we'll be able to support sync between buckets in the same zone, but for now i don't think it's necessary to consult these sync policy objects outside of multisite?

yehudasa · 2022-03-11T16:01:58Z

It is likely that you will break certain cases in bucket level sync this way. The "configured for multisite" hammer is too loosely defined and too big IMO.

cbodley · 2022-03-11T16:05:43Z

It is likely that you will break certain cases in bucket level sync this way.

can you be more specific?

The "configured for multisite" hammer is too loosely defined and too big IMO.

this particular hammer is RGWZone::log_data, which is meant to control whether we're writing to the datalog/bilogs. if that's false, then nothing in per-bucket replication should ever cause us to write datalogs - we'd accumulate data that never gets trimmed

cbodley · 2022-03-24T14:18:58Z

ping @yehudasa

yehudasa · 2022-03-24T14:29:59Z

The idea is not to need to configure sync in multiple places. How about you change that flag to a 3 state, and by default it's not defined (in which case it relies exclusively on sync policy). If true/false would mean that user explicitly had set it. In the case of sync policy, can check on init whether zone allows sync out, if it does not then can change flag to false.

cbodley · 2022-03-24T18:46:32Z

The idea is not to need to configure sync in multiple places.

thanks @yehudasa. i'm still fuzzy on sync policy, can you please check my understanding here?

we have to read these per-bucket hints because the global sync policy is 'allowed' (as opposed to 'enabled' or 'forbidden')
global sync defaults to 'allowed' in single-zone configurations

i'm guessing that 'allowed' is the default, so you can attach a new zone later without having to update this zone's policy? if that's the intent, shouldn't we at least make this 'allowed' behavior conditional on the presence of other zones?

the 'presence of other zones' is essentially all that log_data tracks. at some point, we stopped trusting our users with control over it, so now RGWZoneGroup::post_process_params() overwrites log_data = zones.size() > 1 whenever you make changes to the zonegroup. so log_data doesn't require any extra configuration, it's managed automatically when you add/remove zones

if i am understanding the intent of the 'allowed' sync policy here, then it seems to compose just fine with the log_data change in this PR

How about you change that flag to a 3 state, and by default it's not defined (in which case it relies exclusively on sync policy). If true/false would mean that user explicitly had set it. In the case of sync policy, can check on init whether zone allows sync out, if it does not then can change flag to false.

i'm not sure what you're suggesting with the third state for log_data. in this new state, what part of sync policy is it using to determine whether 'zone allows sync out'? if sync policy can already tell us that, couldn't RGWBucketSyncPolicyHandler::bucket_exports_data() use that to avoid reading these per-bucket hints? or does that decision depend on the per-bucket hints?

yehudasa · 2022-03-24T20:13:40Z

The idea is not to need to configure sync in multiple places.

thanks @yehudasa. i'm still fuzzy on sync policy, can you please check my understanding here?

we have to read these per-bucket hints because the global sync policy is 'allowed' (as opposed to 'enabled' or 'forbidden')

global sync defaults to 'allowed' in single-zone configurations

i'm guessing that 'allowed' is the default, so you can attach a new zone later without having to update this zone's policy? if that's the intent, shouldn't we at least make this 'allowed' behavior conditional on the presence of other zones?

the 'presence of other zones' is essentially all that log_data tracks. at some point, we stopped trusting our users with control over it, so now RGWZoneGroup::post_process_params() overwrites log_data = zones.size() > 1 whenever you make changes to the zonegroup. so log_data doesn't require any extra configuration, it's managed automatically when you add/remove zones

if i am understanding the intent of the 'allowed' sync policy here, then it seems to compose just fine with the log_data change in this PR

How about you change that flag to a 3 state, and by default it's not defined (in which case it relies exclusively on sync policy). If true/false would mean that user explicitly had set it. In the case of sync policy, can check on init whether zone allows sync out, if it does not then can change flag to false.

i'm not sure what you're suggesting with the third state for log_data. in this new state, what part of sync policy is it using to determine whether 'zone allows sync out'? if sync policy can already tell us that, couldn't RGWBucketSyncPolicyHandler::bucket_exports_data() use that to avoid reading these per-bucket hints? or does that decision depend on the per-bucket hints?

The decision depends on it iirc. A bucket policy by itself cannot know that there is/isn't another bucket that has a policy that fetches data from it, that's why we need the hints (that point at other buckets that may sync from us).
As a side note, iirc, one thing that the hints allow is to deal with resharded bucket as the newly sharded bucket instance can get a hint that points it to the older bucket instance.

cbodley · 2022-03-24T20:40:19Z

ok, so back to RGWDataChangesLog::add_entry(). even if there is a policy that covers writes to this bucket, why should we write to the datalog/bilog if we're the only zone in our zonegroup?

if there's no other zone attached, then nobody is going to read those logs. if another zone attaches later, it must start with a full sync and skip past these entries anyway

isn't log_data doing the right thing here?

cbodley · 2022-03-30T14:05:56Z

passed qa in https://pulpito.ceph.com/cbodley-2022-03-25_20:30:36-rgw-wip-cbodley-testing-distro-default-smithi/

cbodley · 2022-03-30T17:22:09Z

do you have any suggestions here @yehudasa?

cbodley · 2022-04-05T18:01:31Z

jenkins test make check

yuvalif · 2022-04-06T10:01:08Z

passed qa in https://pulpito.ceph.com/cbodley-2022-03-25_20:30:36-rgw-wip-cbodley-testing-distro-default-smithi/

do we have bucket sync policy tests in teuthology?

cbodley · 2022-04-06T13:41:16Z

passed qa in https://pulpito.ceph.com/cbodley-2022-03-25_20:30:36-rgw-wip-cbodley-testing-distro-default-smithi/

do we have bucket sync policy tests in teuthology?

no, i merged #31686 for octopus under the condition that tests would follow, but they didn't

yehudasa · 2022-04-13T07:30:03Z

passed qa in https://pulpito.ceph.com/cbodley-2022-03-25_20:30:36-rgw-wip-cbodley-testing-distro-default-smithi/

do we have bucket sync policy tests in teuthology?

no, i merged #31686 for octopus under the condition that tests would follow, but they didn't

Note this PR that is still pending:
#38238

yuvalif · 2022-04-13T07:50:59Z

passed qa in https://pulpito.ceph.com/cbodley-2022-03-25_20:30:36-rgw-wip-cbodley-testing-distro-default-smithi/

do we have bucket sync policy tests in teuthology?

no, i merged #31686 for octopus under the condition that tests would follow, but they didn't

Note this PR that is still pending: #38238

this is from the end of 2020 !
what is blocking that PR?

cbodley requested a review from yehudasa March 11, 2022 14:52

github-actions bot added the rgw label Mar 11, 2022

cbodley added the wip-cbodley-testing label Mar 25, 2022

cbodley requested review from adamemerson and yuvalif April 5, 2022 18:01

yuvalif approved these changes Apr 11, 2022

View reviewed changes

cbodley merged commit d104ae5 into ceph:master Apr 12, 2022

cbodley mentioned this pull request May 24, 2023

rgw: disable RGWDataChangesLog::add_entry() when log_data is off #51734

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rgw: disable RGWDataChangesLog::add_entry() when log_data is off #45357

rgw: disable RGWDataChangesLog::add_entry() when log_data is off #45357

cbodley commented Mar 11, 2022

cbodley commented Mar 11, 2022

yehudasa commented Mar 11, 2022

cbodley commented Mar 11, 2022

cbodley commented Mar 24, 2022

yehudasa commented Mar 24, 2022

cbodley commented Mar 24, 2022

yehudasa commented Mar 24, 2022

cbodley commented Mar 24, 2022

cbodley commented Mar 30, 2022

cbodley commented Mar 30, 2022

cbodley commented Apr 5, 2022

yuvalif commented Apr 6, 2022

cbodley commented Apr 6, 2022

yehudasa commented Apr 13, 2022

yuvalif commented Apr 13, 2022

rgw: disable RGWDataChangesLog::add_entry() when log_data is off #45357

rgw: disable RGWDataChangesLog::add_entry() when log_data is off #45357

Conversation

cbodley commented Mar 11, 2022

cbodley commented Mar 11, 2022

yehudasa commented Mar 11, 2022

cbodley commented Mar 11, 2022

cbodley commented Mar 24, 2022

yehudasa commented Mar 24, 2022

cbodley commented Mar 24, 2022

yehudasa commented Mar 24, 2022

cbodley commented Mar 24, 2022

cbodley commented Mar 30, 2022

cbodley commented Mar 30, 2022

cbodley commented Apr 5, 2022

yuvalif commented Apr 6, 2022

cbodley commented Apr 6, 2022

yehudasa commented Apr 13, 2022

yuvalif commented Apr 13, 2022