RGW: support for tagging in lifecycle policies #17305

theanalyst · 2017-08-28T13:29:04Z

This PR adds support for tagging in object lifecycle policies, if a lifecycle policy has a tag attribute, then while processing objects in the bucket we additionally retrieve the objects tags to check whether all the tags in LC policy are contained in the object's tags.

Fix the mtime issue with tagging (rgw: rgw_rados: set_attrs now sets the same time for BI & object #17400)
Refactor list objects within LC to already filter out based on params & tagging (so that ops like mp_expire do not need to C-c C-v the same code) - I'll do this when we tackle expiration and tagging
Add S3 tests (for functionality, we first need to fix the timing of LC ops itself) (add tests for lifecycle with tagging s3-tests#188)

theanalyst · 2017-08-28T13:31:23Z

src/rgw/rgw_lc.cc

            int ret = store->get_obj_state(&rctx, bucket_info, obj, &state, false);
            if (ret < 0) {
              return ret;
            }
-            if (state->mtime != obj_iter->meta.mtime)//Check mtime again to avoid delete a recently update object as much as possible
+            if (state->mtime != obj_iter->meta.mtime) {


Currently this seems to be valid if an object is modified via say a put_object_tagging operation, in which case the mtime from state is older than the one in bucket index. And this will end up with a case of the object never getting deleted

For eg:

skipping removal: state->mtime 2017-08-28 16:17:44.0.565582s obj->mtime 2017-08-28 16:17:44.0.574707s

will look into this

theanalyst · 2017-08-31T12:04:32Z

created #17400 for the mtime mismatch

dang

This looks good. When you feel the mtime issue is fixed, remove the DNM.

Useful when populating tags from XML, empty() is useful to determine the existence of tags Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>

- rgw_lc_s3: + LCFilter class now has an obj_tags member which shall store the object tags, the related methods like empty() is updated to account for object tags as well + LCFilter_S3: parsing now accounts for the <And> xml tag which specifies if multiple conditions are allowed, since we parse tags as an iterator, we keep a count of the tags and actually validate that the And tag was supplied if we see multiple tags. - rgw_lc: add support for tagging + lc_op has obj_tags implemented as a boost::optional, which gets populated only when rule has tags + we use std::includes to compare that all the tags in policy are a part of the object. We only read object tags if tags are a part of the LC rule - rgw_lc: have consistent log message styles when using __func__ Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>

theanalyst · 2017-09-20T15:35:22Z

changelog: changed from return to continue if read_obj_tags fails, as this would fail with an ENODATA if there are no tags, either way if the reading xattr failed when the supplied LC policy has a tagging conditional, it would make sense to continue processing the other objects.

theanalyst · 2017-09-20T15:38:41Z

@dang added s3 tests, removed the DNM label, can you take a look again

theanalyst · 2017-10-17T09:34:21Z

@dang ran rgw suite on this pr with
http://pulpito.ceph.com/abhi-2017-10-12_13:15:36-rgw-wip-rgw-lc-tagging-distro-basic-smithi/ & rerun of failures
with http://pulpito.ceph.com/abhi-2017-10-13_14:29:33-rgw-wip-rgw-lc-tagging-distro-basic-smithi/ the tests are green except for one failure seen in LC delete marker expiration in the first run which is a timing based test. (passed on rerun) the other valgrind and multisite failures seem to be knon failures, is this good to go?

dang · 2017-10-23T13:30:51Z

LGTM

yuriw · 2017-10-23T15:20:11Z

wip-yuri2-testing-2017-10-23-1517

theanalyst commented Aug 28, 2017

View reviewed changes

theanalyst requested review from adamemerson, dang and yehudasa August 28, 2017 13:31

theanalyst added DNM feature rgw labels Aug 28, 2017

adamemerson approved these changes Aug 30, 2017

View reviewed changes

dang approved these changes Sep 1, 2017

View reviewed changes

theanalyst mentioned this pull request Sep 11, 2017

rgw: drop misc unused set_attr #17629

Closed

theanalyst added 3 commits September 19, 2017 11:24

rgw_tag: implement emplace, empty & clear methods

39e54da

Useful when populating tags from XML, empty() is useful to determine the existence of tags Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>

rgw: rgw_tag_s3: make dump_xml a const member function

aba6427

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>

theanalyst force-pushed the rgw-lc-tagging branch from 61c77cd to d3100dd Compare September 20, 2017 15:32

theanalyst removed the DNM label Sep 20, 2017

theanalyst changed the title ~~DNM: RGW: support for tagging in lifecycle policies~~ RGW: support for tagging in lifecycle policies Sep 20, 2017

dang approved these changes Sep 28, 2017

View reviewed changes

theanalyst added the needs-qa label Oct 23, 2017

yuriw added the wip-yuri2-testing label Oct 23, 2017

yuriw merged commit 33191d2 into ceph:master Oct 23, 2017

theanalyst mentioned this pull request Jan 9, 2018

add tests for lifecycle with tagging ceph/s3-tests#188

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RGW: support for tagging in lifecycle policies #17305

RGW: support for tagging in lifecycle policies #17305

theanalyst commented Aug 28, 2017 •

edited

theanalyst Aug 28, 2017

theanalyst Aug 28, 2017 •

edited

theanalyst commented Aug 31, 2017

dang left a comment

theanalyst commented Sep 20, 2017

theanalyst commented Sep 20, 2017

theanalyst commented Oct 17, 2017

dang commented Oct 23, 2017

yuriw commented Oct 23, 2017

RGW: support for tagging in lifecycle policies #17305

RGW: support for tagging in lifecycle policies #17305

Conversation

theanalyst commented Aug 28, 2017 • edited

theanalyst Aug 28, 2017

Choose a reason for hiding this comment

theanalyst Aug 28, 2017 • edited

Choose a reason for hiding this comment

theanalyst commented Aug 31, 2017

dang left a comment

Choose a reason for hiding this comment

theanalyst commented Sep 20, 2017

theanalyst commented Sep 20, 2017

theanalyst commented Oct 17, 2017

dang commented Oct 23, 2017

yuriw commented Oct 23, 2017

theanalyst commented Aug 28, 2017 •

edited

theanalyst Aug 28, 2017 •

edited