[DNM]Wip rgw d4n next #56336

pritha-srivastava · 2024-03-20T10:43:32Z

This PR contains the next set of changes for the d4n filter driver.

This PR so far has achieved a working write-back cache on a single node for non-multipart (small) objects, for both versioned and non-versioned objects.

To test various scenarios, bring up vstart cluster as follows:
MON=1 OSD=1 RGW=1 MGR=0 MDS=0 ../src/vstart.sh -n -d -o rgw_d4n_l1_datacache_persistent_path=/home/prsrivas/ceph/build/rgw_d4n_datacache/ -o rgw_d4n_l1_datacache_size=5368709120 -o rgw_filter=d4n -o d4n_writecache_enabled=true -o rgw_d4n_cache_cleaning_interval=600

The following steps are for uploading and downloading an object to/from a non-versioned bucket:

Create a bucket:
aws s3 mb s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
Upload a small object (non-multipart) object:
aws s3 cp ./1M s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
Check d4n datacache contents - you will see one entry for head, and other entries belonging to data of an object
ls -l rgw_d4n_datacache/
-rw-r--r--. 1 prsrivas prsrivas 0 May 2 12:43 D_my-new-bucket_09v6V6FcjyJ17hNJmFgysOjxtLAAX2n_1M
-rw-r--r--. 1 prsrivas prsrivas 1048576 May 2 12:43 D_my-new-bucket_09v6V6FcjyJ17hNJmFgysOjxtLAAX2n_1M_0_1048576
Check if get-object works:
aws s3api get-object --bucket my-new-bucket --key 1M --endpoint-url http://localhost:8000 --region us-east-1 ./1M-out
Check md5 of the objects, to ensure that the contents are as expected:
md5sum ./1M ./1M-out
Now wait for the cleaning process to kick in (roughly after rgw_d4n_cache_cleaning_interval which is 600 seconds)
Check d4n datacache contents - all dirty entries are converted to non-dirty now (D prefix removed), which means they have been written to backend store
-rw-r--r--. 1 prsrivas prsrivas 0 May 2 12:43 my-new-bucket_09v6V6FcjyJ17hNJmFgysOjxtLAAX2n_1M
-rw-r--r--. 1 prsrivas prsrivas 1048576 May 2 12:43 my-new-bucket_09v6V6FcjyJ17hNJmFgysOjxtLAAX2n_1M_0_1048576
Check get-object now
aws s3api get-object --bucket my-new-bucket --key 1M --endpoint-url http://localhost:8000 --region us-east-1 ./1M-cache
Check md5 of the objects, to ensure that the contents are as expected:
md5sum ./1M ./1M-cache

The following steps are for uploading/downloading an object to/from a versioned bucket:

Create a bucket:
aws s3 mb s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
Enable versioning on the bucket:
aws s3api put-bucket-versioning --bucket my-new-bucket --versioning-configuration Status=Enabled --endpoint-url http://localhost:8000 --region us-east-1
Upload an object:
aws s3 cp ./1M s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
Check d4n datacache contents for the head and data entries
Check get-object without specifying version-id
aws s3api get-object --bucket my-new-bucket --key 1M --endpoint-url http://localhost:8000 --region us-east-1 ./1M-out
Check get-object by specifying a version-id
aws s3api get-object --bucket my-new-bucket --key 1M --version-id "09v6V6FcjyJ17hNJmFgysOjxtLAAX2n" --endpoint-url http://localhost:8000 --region us-east-1 ./1M-out
Now wait for the cleaning process to kick in (roughly after rgw_d4n_cache_cleaning_interval which is 600 seconds)
Check d4n datacache contents - all dirty entries are converted to non-dirty now (D prefix removed), which means they have been written to backend store
Now check get-object with and without version-id as in step 6 and 7.

Testing steps for copy object:
when both source and destination buckets are non-versioned

aws s3 mb s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3 mb s3://my-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3 cp ./1M s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3api get-object --bucket my-new-bucket --key 1M --endpoint-url http://localhost:8000 --region us-east-1 ./1M-out
aws s3api copy-object --bucket my-bucket --copy-source my-new-bucket/1M --key 1M-copy --endpoint-url http://localhost:8000 --region us-east-1
check cache contents using ls -l rgw_d4n_datacache/
aws s3api get-object --bucket my-bucket --key 1M-copy --endpoint-url http://localhost:8000 --region us-east-1 ./1M-copy-out
compare md5 of both 1M and 1M-copy-out
Wait for cleaning to kick in
Call get-object like step 7. to check if the object is fetched correctly from the backend store.

when source bucket is versioned

aws s3 mb s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3api put-bucket-versioning --bucket my-new-bucket --versioning-configuration Status=Enabled --endpoint-url http://localhost:8000 --region us-east-1
aws s3 cp ./1M s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3 mb s3://my-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3api copy-object --bucket my-bucket --copy-source my-new-bucket/1M --key 1M-latest --endpoint-url http://localhost:8000 --region us-east-1
aws s3api get-object --bucket my-bucket --key 1M-latest --endpoint-url http://localhost:8000 --region us-east-1 ./1M-latest-out
md5sum ./1M ./1M-latest-out
wait for cleaning process to kick in
call get-object like step 6. to check if the object is read from the backend store correctly.

when destination bucket is versioned:

aws s3 mb s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3 mb s3://my-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3api put-bucket-versioning --bucket my-bucket --versioning-configuration Status=Enabled --endpoint-url http://localhost:8000 --region us-east-1
aws s3 cp ./1M s3://my-new-bucket --endpoint-url http://localhost:8000 --region us-east-1
aws s3api copy-object --bucket my-bucket --copy-source my-new-bucket/1M --key 1M-latest --endpoint-url http://localhost:8000 --region us-east-1
aws s3api get-object --bucket my-bucket --key 1M-latest --endpoint-url http://localhost:8000 --region us-east-1 ./1M-latest-out
wait for cleaning process to kick in
call get-object like step 6. to check if object is correctly read from the backend store.

Things that do NOT work:

list-object-versions does not work
delete object

Contribution Guidelines

To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an x between the brackets: [x]. Spaces and capitalization matter when checking off items this way.

Checklist

Tracker (select at least one)
- References tracker ticket
- Very recent bug; references commit where it was introduced
- New feature (ticket optional)
- Doc update (no ticket needed)
- Code cleanup (no ticket needed)
Component impact
- Affects Dashboard, opened tracker ticket
- Affects Orchestrator, opened tracker ticket
- No impact that needs to be tracked
Documentation (select at least one)
- Updates relevant documentation
- No doc update is appropriate
Tests (select at least one)
- Includes unit test(s)
- Includes integration test(s)
- Includes bug reproducer
- No tests

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox
jenkins test windows
jenkins test rook e2e

github-actions · 2024-04-04T16:30:48Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

modifications in ReadOp::prepare() method of the d4n filter driver to cache the head object. modification in get_obj_attrs to read from cache or backend store. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

src/rgw/driver/d4n/rgw_sal_d4n.cc

Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

Read process needs to be updated based on write process. It needs to check where is the data and if it is dirty or clean. If it is in the cache and dirty, we need to put D_ in the oid of the object before reading it from cache. If it is clean, there is nothing to do. Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

process. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

which has objects ordered by their creation time and the top element of which is fetched in the cleaning method, processed and deleted in a loop. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

bucket_name_version_object_name_ofs_len, to avoid checks for versioned and non-versioned objects. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

and delete_obj_attrs() to check if the head object exists in a cache, else direct the calls to backend store. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

while writing the object. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

RGWRados, in case ReadOp::prepare() reads the head object from the cache. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

github-actions · 2024-05-10T17:52:56Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

1. storing objects in directory using their oid, so that the version is included. 2. making sure that the head block corresponds to latest version in the block directory. 3. add a directory entry for head block for every version in case of a versioned bucket. 4. Populating hostsList correctly for blocks and objects. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

…cript Signed-off-by: Samarah <samarah.uriarte@ibm.com>

data handling and faster completion Signed-off-by: Samarah <samarah.uriarte@ibm.com>

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

…sistent values, and fix directory updates in `cleanup` method Signed-off-by: Samarah <samarah.uriarte@ibm.com>

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

… (LFUDA). Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

pritha-srivastava requested a review from a team as a code owner March 20, 2024 10:43

github-actions bot added build/ops common rgw tests labels Mar 20, 2024

pritha-srivastava marked this pull request as draft March 20, 2024 10:43

pritha-srivastava force-pushed the wip-rgw-d4n-next branch from fd16a3e to 5f2ee68 Compare March 20, 2024 16:32

github-actions bot added the needs-rebase label Apr 4, 2024

pritha-srivastava and others added 3 commits April 22, 2024 13:14

rgw/d4n: implementation of caching head in read workflow.

22baaad

modifications in ReadOp::prepare() method of the d4n filter driver to cache the head object. modification in get_obj_attrs to read from cache or backend store. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

rgw/d4n: Add directory probing to read workflow

b3e73c0

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

combining write cache with latest D4N code

68710da

Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

samarahu reviewed Apr 26, 2024

View reviewed changes

src/rgw/driver/d4n/rgw_sal_d4n.cc Outdated Show resolved Hide resolved

pritha-srivastava force-pushed the wip-rgw-d4n-next branch from 24339ea to 440f00b Compare May 2, 2024 08:17

github-actions bot removed the needs-rebase label May 2, 2024

mosayyebzadeh added 5 commits May 2, 2024 16:08

cleaning the code

d18474c

Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

removing some bugs on bigger objects

2de85ff

Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

updating iterate function to check the dirty flag

b6297c0

Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

updating flush functions and comments.

d37554b

Signed-off-by: mosayyebzadeh <mosayyeb@bu.edu>

pritha-srivastava force-pushed the wip-rgw-d4n-next branch from 440f00b to 28b9921 Compare May 2, 2024 10:40

pritha-srivastava added 6 commits May 6, 2024 12:27

rgw/d4n: implementation for caching head object in write-back workflow.

6d84909

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

rgw/d4n: modifications to get write back cache working after cleaning

2237375

process. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

rgw/d4n: modifications for eviction of dirty blocks.

047dc65

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

rgw/d4n: modifications include adding a heap of dirty objects

0f0c3d0

which has objects ordered by their creation time and the top element of which is fetched in the cleaning method, processed and deleted in a loop. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

rgw/d4n: changing the format of cached blocks to

533bf6a

bucket_name_version_object_name_ofs_len, to avoid checks for versioned and non-versioned objects. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

rgw/d4n: modifications to set_obj_attrs(), modify_obj_attrs()

3976cb9

and delete_obj_attrs() to check if the head object exists in a cache, else direct the calls to backend store. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

pritha-srivastava force-pushed the wip-rgw-d4n-next branch from 28b9921 to 3976cb9 Compare May 6, 2024 07:23

rgw/d4n: handling version in case of bucket versioning being suspended

b01a2dc

while writing the object. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

samarahu closed this May 7, 2024

rgw/d4n: fix to correctly populate rgw_obj of GetObjectState in

bfb7cc9

RGWRados, in case ReadOp::prepare() reads the head object from the cache. Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

pritha-srivastava reopened this May 10, 2024

github-actions bot added the needs-rebase label May 10, 2024

pritha-srivastava and others added 16 commits May 20, 2024 15:10

d4n: Add rename method and unit test and update D4N unit tests

cae5123

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

d4n/directory: Use boost::split for simpler code

f5ce32e

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

rgw: Lower log levels for failures in D4N and redis cache files

70d2ac7

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

rgw: Add dpp and logs to directory, cache, and policy

881af08

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

rgw: Reduce Redis calls and fix workflow

5bcbfc4

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

qa/d4n: Remove D4N task and add S3 user creation to workunit driver s…

9ac2a58

…cript Signed-off-by: Samarah <samarah.uriarte@ibm.com>

d4n: Use Redis transactions to serialize consecutive requests for safe

4ddad94

data handling and faster completion Signed-off-by: Samarah <samarah.uriarte@ibm.com>

d4n/directory: Remove boost lexical_cast calls

e0b53bf

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

rgw/d4n: Add return values to error logs

7f10f1e

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

rgw/d4n: Change directory hostsList to use unordered_set

489d04d

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

d4n/filter: Simplify logic for storing block in handle_data

b2118a4

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

rgw/policy: Properly delete LFUDAEntry instances

ef8b36b

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

rgw/d4n: Add support for dirty block metadata, check_bool for con…

100c6f5

…sistent values, and fix directory updates in `cleanup` method Signed-off-by: Samarah <samarah.uriarte@ibm.com>

rgw/d4n: implementation of copyObject.

bef6554

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

rgw/d4n: adding perfcounters for d4n cache hits, misses and evictions…

40fec45

… (LFUDA). Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DNM]Wip rgw d4n next #56336

[DNM]Wip rgw d4n next #56336

pritha-srivastava commented Mar 20, 2024 •

edited

github-actions bot commented Apr 4, 2024

github-actions bot commented May 10, 2024

[DNM]Wip rgw d4n next #56336

Are you sure you want to change the base?

[DNM]Wip rgw d4n next #56336

Conversation

pritha-srivastava commented Mar 20, 2024 • edited

Contribution Guidelines

Checklist

github-actions bot commented Apr 4, 2024

github-actions bot commented May 10, 2024

pritha-srivastava commented Mar 20, 2024 •

edited