Enhance handleops readdirplus operation to return handles along with dirents #174

soumyakoduri · 2017-04-18T07:30:27Z

Any application using glfs handles (like NFS-Ganesha) may need to do lookup on each dirent returned by readdir/readdirp operation to create handles. But since the lookup in glusterfs is very costly operation (as need to be sent to all replica bricks), as the directory gets larger or the replica count increases, the time taken for single readdir operation to complete can take hours of time. To avoid that we need an extended readdirp API which can return handles along with dirent stat as part of its reply.

ShyamsundarR · 2017-04-26T14:49:03Z

This is being tracked for a week later backport to 3.11 post branching! (May-05)

@todo

Its known that readdirplus operation fetches stat as well for each of the dirents. But often applications may need extra information, like for eg., NFS-Ganesha which operates on handles needs handles for each of those dirents returned. So this would require extra calls to the backend, in this case LOOKUP (which is very expensive operation) resulting in very low readdir performance. To address that introducing this new API using which applications can make request for any extra information to be returned as part of readdirplus response. Currently this new api returns stat and handles as demanded by application. The synopsis of the API is noted in glfs.h. @todo: * Enhance test script using this new API Below were the perf results on single brick volume with and without these changes - Dataset used - 10*100 directories and each directory containing 100 empty files. I used NFS-Ganesha application to test these changes - >for i in {1..5}; do systemctl restart nfs-ganesha; sleep 10; mount -t nfs -o vers=4 localhost:/brick_vol /mnt; cd /mnt; echo "ITERATION$i"; date; find . > tmp-nfs.log; date; cd /; umount /mnt; sleep 2; done; Without these changes - ITERATION1 Mon Mar 20 17:22:26 IST 2017 Mon Mar 20 17:23:18 IST 2017 ITERATION2 Mon Mar 20 17:23:39 IST 2017 Mon Mar 20 17:24:28 IST 2017 ITERATION3 Mon Mar 20 17:24:49 IST 2017 Mon Mar 20 17:25:36 IST 2017 ITERATION4 Mon Mar 20 17:30:57 IST 2017 Mon Mar 20 17:31:37 IST 2017 ITERATION5 Mon Mar 20 17:31:57 IST 2017 Mon Mar 20 17:32:40 IST 2017 [root@dhcp35-197 /]# On an average ~46.2 sec With these changes applied - ITERATION1 Mon Mar 20 17:35:03 IST 2017 Mon Mar 20 17:35:15 IST 2017 ITERATION2 Mon Mar 20 17:35:36 IST 2017 Mon Mar 20 17:35:46 IST 2017 ITERATION3 Mon Mar 20 17:36:06 IST 2017 Mon Mar 20 17:36:17 IST 2017 ITERATION4 Mon Mar 20 17:41:38 IST 2017 Mon Mar 20 17:41:49 IST 2017 ITERATION5 Mon Mar 20 17:42:10 IST 2017 Mon Mar 20 17:42:20 IST 2017 On an average ~10.8 sec Updates #174 BUG: 1442950 Change-Id: I0f74f74dc62085ca4c4a23c38e3edc84bd850876 Signed-off-by: Soumya Koduri <skoduri@redhat.com> Reviewed-on: https://review.gluster.org/15663 Smoke: Gluster Build System <jenkins@build.gluster.org> NetBSD-regression: NetBSD Build System <jenkins@build.gluster.org> Reviewed-by: Niels de Vos <ndevos@redhat.com> CentOS-regression: Gluster Build System <jenkins@build.gluster.org>

@todo

Its known that readdirplus operation fetches stat as well for each of the dirents. But often applications may need extra information, like for eg., NFS-Ganesha which operates on handles needs handles for each of those dirents returned. So this would require extra calls to the backend, in this case LOOKUP (which is very expensive operation) resulting in very low readdir performance. To address that introducing this new API using which applications can make request for any extra information to be returned as part of readdirplus response. Currently this new api returns stat and handles as demanded by application. The synopsis of the API is noted in glfs.h. @todo: * Enhance test script using this new API Below were the perf results on single brick volume with and without these changes - Dataset used - 10*100 directories and each directory containing 100 empty files. I used NFS-Ganesha application to test these changes - >for i in {1..5}; do systemctl restart nfs-ganesha; sleep 10; mount -t nfs -o vers=4 localhost:/brick_vol /mnt; cd /mnt; echo "ITERATION$i"; date; find . > tmp-nfs.log; date; cd /; umount /mnt; sleep 2; done; Without these changes - ITERATION1 Mon Mar 20 17:22:26 IST 2017 Mon Mar 20 17:23:18 IST 2017 ITERATION2 Mon Mar 20 17:23:39 IST 2017 Mon Mar 20 17:24:28 IST 2017 ITERATION3 Mon Mar 20 17:24:49 IST 2017 Mon Mar 20 17:25:36 IST 2017 ITERATION4 Mon Mar 20 17:30:57 IST 2017 Mon Mar 20 17:31:37 IST 2017 ITERATION5 Mon Mar 20 17:31:57 IST 2017 Mon Mar 20 17:32:40 IST 2017 [root@dhcp35-197 /]# On an average ~46.2 sec With these changes applied - ITERATION1 Mon Mar 20 17:35:03 IST 2017 Mon Mar 20 17:35:15 IST 2017 ITERATION2 Mon Mar 20 17:35:36 IST 2017 Mon Mar 20 17:35:46 IST 2017 ITERATION3 Mon Mar 20 17:36:06 IST 2017 Mon Mar 20 17:36:17 IST 2017 ITERATION4 Mon Mar 20 17:41:38 IST 2017 Mon Mar 20 17:41:49 IST 2017 ITERATION5 Mon Mar 20 17:42:10 IST 2017 Mon Mar 20 17:42:20 IST 2017 On an average ~10.8 sec This is backport of below upstream patch - https://review.gluster.org/15663 >Updates #174 >BUG: 1442950 >Change-Id: I0f74f74dc62085ca4c4a23c38e3edc84bd850876 >Signed-off-by: Soumya Koduri <skoduri@redhat.com> >Reviewed-on: https://review.gluster.org/15663 >Smoke: Gluster Build System <jenkins@build.gluster.org> >NetBSD-regression: NetBSD Build System <jenkins@build.gluster.org> >Reviewed-by: Niels de Vos <ndevos@redhat.com> >CentOS-regression: Gluster Build System <jenkins@build.gluster.org> BUG: 1447571 Change-Id: I0f74f74dc62085ca4c4a23c38e3edc84bd850876 Signed-off-by: Soumya Koduri <skoduri@redhat.com> Reviewed-on: https://review.gluster.org/17164 Smoke: Gluster Build System <jenkins@build.gluster.org> NetBSD-regression: NetBSD Build System <jenkins@build.gluster.org> CentOS-regression: Gluster Build System <jenkins@build.gluster.org> Reviewed-by: Niels de Vos <ndevos@redhat.com>

gluster-ant · 2017-05-23T13:50:02Z

A patch https://review.gluster.org/17372 has been posted that references this issue.
Commit message: Update release notes of 3.11 for readdirplus enhancements

Updates: gluster#174 Change-Id: I006cafc622c9ee2d776556c287e3ed1743ffa84b Signed-off-by: Soumya Koduri <skoduri@redhat.com>

gluster-ant · 2017-05-24T07:33:56Z

A patch https://review.gluster.org/17372 has been posted that references this issue.
Commit message: Update release notes of 3.11 for readdirplus enhancements

gluster-ant · 2017-05-24T07:39:24Z

A patch https://review.gluster.org/17372 has been posted that references this issue.
Commit message: Update release notes of 3.11 for readdirplus enhancements

gluster-ant · 2017-05-24T09:00:18Z

A patch https://review.gluster.org/17372 has been posted that references this issue.
Commit message: Update release notes of 3.11 for readdirplus enhancements

Updates: #174 Change-Id: I006cafc622c9ee2d776556c287e3ed1743ffa84b Signed-off-by: Soumya Koduri <skoduri@redhat.com> Reviewed-on: https://review.gluster.org/17372 CentOS-regression: Gluster Build System <jenkins@build.gluster.org> NetBSD-regression: NetBSD Build System <jenkins@build.gluster.org> Smoke: Gluster Build System <jenkins@build.gluster.org> Reviewed-by: Shyamsundar Ranganathan <srangana@redhat.com>

ShyamsundarR · 2017-05-25T20:20:01Z

All artifacts for this feature are in, code + documentation. Closing the issue.

As a pointer for the future, when submitting the last artifact for the issue, use fixes #n in the commit message to auto close the issue.

ShyamsundarR added CB: gfapi FA: Developer Experience FA: Performance Improvements labels Apr 26, 2017

ShyamsundarR added this to the Release 3.11 (STM) milestone Apr 26, 2017

ShyamsundarR added this to Release 3.11 in Releases Apr 26, 2017

soumyakoduri added a commit to soumyakoduri/glusterfs that referenced this issue May 24, 2017

Update release notes of 3.11 for readdirplus enhancements

233ee7e

Updates: gluster#174 Change-Id: I006cafc622c9ee2d776556c287e3ed1743ffa84b Signed-off-by: Soumya Koduri <skoduri@redhat.com>

ShyamsundarR closed this as completed May 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance handleops readdirplus operation to return handles along with dirents #174

Enhance handleops readdirplus operation to return handles along with dirents #174

soumyakoduri commented Apr 18, 2017

ShyamsundarR commented Apr 26, 2017

gluster-ant commented May 23, 2017

gluster-ant commented May 24, 2017

gluster-ant commented May 24, 2017

gluster-ant commented May 24, 2017

ShyamsundarR commented May 25, 2017

Enhance handleops readdirplus operation to return handles along with dirents #174

Enhance handleops readdirplus operation to return handles along with dirents #174

Comments

soumyakoduri commented Apr 18, 2017

ShyamsundarR commented Apr 26, 2017

gluster-ant commented May 23, 2017

gluster-ant commented May 24, 2017

gluster-ant commented May 24, 2017

gluster-ant commented May 24, 2017

ShyamsundarR commented May 25, 2017