RGW-NFS: Use rados cluster_stat to report filesystem usage #20093

supriti · 2018-01-24T08:49:23Z

Partially fixes: http://tracker.ceph.com/issues/22202

Signed-off-by: Supriti Singh supriti.singh@suse.com

supriti · 2018-01-24T14:52:00Z

mattbenjamin

when I ran this change under gdb, I naively checked the value of stats after executing RGWGetClusterStatReq, and found it to be all-0. I have objects in this cluster, it's over a week old. Can you think of a reason for that?

Thread 1 "ganesha.nfsd" hit Breakpoint 2, rgw_statfs (rgw_fs=, parent_fh=, vfs_st=0x7fffffffd490, flags=)
at /home/mbenjamin/ceph-noob/src/rgw/rgw_file.cc:1630
1630 if (rc < 0) {
(gdb) p rc
$1 =
(gdb) list
1625 RGWLibFS fs = static_cast<RGWLibFS>(rgw_fs->fs_private);
1626 struct rados_cluster_stat_t stats;
1627
1628 RGWGetClusterStatReq req(fs->get_context(), fs->get_user(),stats);
1629 int rc = rgwlib.get_fe()->execute_req(&req);
1630 if (rc < 0) {
1631 lderr(fs->get_context()) << "ERROR: getting total cluster usage"
1632 << cpp_strerror(-rc) << dendl;
1633 return rc;
1634 }

(gdb) n
1644 vfs_st->f_bsize = 1 << CEPH_BLOCK_SHIFT;
(gdb) p stats
$3 = {kb = 0, kb_used = 0, kb_avail = 0, num_objects = 0}

mattbenjamin · 2018-01-26T17:59:05Z

src/rgw/rgw_file.cc

-  vfs_st->f_bavail = UINT64_MAX;
-  vfs_st->f_files = 1024; /* object count, do we have an est? */
-  vfs_st->f_ffree = UINT64_MAX;
+  /*


It doesn't seem like we should set a blocksize of 4M as of now; 1M is the most common client default for rwsize; separately, you have reported problems using this value. in upcoming nfs writeback changeset, I dimension into 4M extents == chunks, but subdivide each extent into 1M pages

mattbenjamin · 2018-01-26T18:00:55Z

src/rgw/rgw_file.cc

+    * blocks.  We use 4MB only because it is big enough, and because it
+    * actually *is* the (ceph) default block size.
+  */
+  const int CEPH_BLOCK_SHIFT = 22;


I have a better source for this constant later (in extent package), good to keep it at local scope; can we make it constexpr? can we make it uint32_t?

mattbenjamin · 2018-01-26T18:39:00Z

src/rgw/rgw_file.cc

@@ -1622,16 +1622,31 @@ int rgw_statfs(struct rgw_fs *rgw_fs,
 	       struct rgw_statvfs *vfs_st, uint32_t flags)
 {
  RGWLibFS *fs = static_cast<RGWLibFS*>(rgw_fs->fs_private);
+  struct rados_cluster_stat_t stats;
+
+  RGWGetClusterStatReq req(fs->get_context(), fs->get_user(),stats);


space before stats

supriti · 2018-01-29T13:07:03Z

@mattbenjamin
I was testing using vstart and running a ganesha instance pointing to vstart ceph.conf. Running "df -h" on mount point shows the usage.

I also ran with gdb. I started ganesha process, and attached gdb to it. I set a breakpoint at rgw_statfs.
I can see the right stats in stack trace.
gdb) l 1619 get filesystem attributes 1620 */ 1621 int rgw_statfs(struct rgw_fs *rgw_fs, 1622 struct rgw_file_handle *parent_fh, 1623 struct rgw_statvfs *vfs_st, uint32_t flags) 1624 { 1625 RGWLibFS *fs = static_cast<RGWLibFS*>(rgw_fs->fs_private); 1626 struct rados_cluster_stat_t stats; 1627 1628 RGWGetClusterStatReq req(fs->get_context(), fs->get_user(), stats); (gdb) n 1625 RGWLibFS *fs = static_cast<RGWLibFS*>(rgw_fs->fs_private); (gdb) n 1628 RGWGetClusterStatReq req(fs->get_context(), fs->get_user(), stats); (gdb) n 1625 RGWLibFS *fs = static_cast<RGWLibFS*>(rgw_fs->fs_private); (gdb) n 1628 RGWGetClusterStatReq req(fs->get_context(), fs->get_user(), stats); (gdb) n 1629 int rc = rgwlib.get_fe()->execute_req(&req); (gdb) n 1628 RGWGetClusterStatReq req(fs->get_context(), fs->get_user(), stats); (gdb) p stats $1 = {kb = 140405592657808, kb_used = 4416878, kb_avail = 0, num_objects = 29362600} (gdb)

Partially fixes: http://tracker.ceph.com/issues/22202 Signed-off-by: Supriti Singh <supriti.singh@suse.com>

supriti · 2018-01-31T14:07:39Z

@mattbenjamin addressed your comments and submitted new patch. Please check

mattbenjamin

lgtm--I'll re-test later today, should be fine

supriti · 2018-02-05T16:34:28Z

@mattbenjamin ping. Were you able to test this patch.

mattbenjamin · 2018-02-05T20:08:50Z

@supriti I've retested. In my environment, I continue to see 0 values reported from rados::cluster_stat()--having said that, I see nothing wrong with the logic being executed, and I guess I have to assume my test cluster is reporting 0-stats.

mattbenjamin · 2018-02-20T16:43:48Z

@supriti this works beautifully; note to self: don't forget to run ceph-mgr ;)

batrick added rgw needs-review labels Jan 24, 2018

smithfarm requested a review from mattbenjamin January 26, 2018 17:24

mattbenjamin reviewed Jan 26, 2018

View reviewed changes

mattbenjamin self-assigned this Jan 26, 2018

RGW-NFS: Use rados cluster_stat to report filesystem usage

29630cb

Partially fixes: http://tracker.ceph.com/issues/22202 Signed-off-by: Supriti Singh <supriti.singh@suse.com>

supriti force-pushed the wip_rgw_ganesha_df branch from 6ecf66f to 29630cb Compare January 29, 2018 22:17

mattbenjamin approved these changes Jan 31, 2018

View reviewed changes

mattbenjamin added wip-matt-testing and removed needs-review labels Feb 5, 2018

mattbenjamin merged commit 11526c6 into ceph:master Feb 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RGW-NFS: Use rados cluster_stat to report filesystem usage #20093

RGW-NFS: Use rados cluster_stat to report filesystem usage #20093

supriti commented Jan 24, 2018

supriti commented Jan 24, 2018

mattbenjamin left a comment

mattbenjamin Jan 26, 2018

mattbenjamin Jan 26, 2018

mattbenjamin Jan 26, 2018

supriti commented Jan 29, 2018 •

edited

supriti commented Jan 31, 2018

mattbenjamin left a comment

supriti commented Feb 5, 2018

mattbenjamin commented Feb 5, 2018

mattbenjamin commented Feb 20, 2018

RGW-NFS: Use rados cluster_stat to report filesystem usage #20093

RGW-NFS: Use rados cluster_stat to report filesystem usage #20093

Conversation

supriti commented Jan 24, 2018

supriti commented Jan 24, 2018

mattbenjamin left a comment

Choose a reason for hiding this comment

mattbenjamin Jan 26, 2018

Choose a reason for hiding this comment

mattbenjamin Jan 26, 2018

Choose a reason for hiding this comment

mattbenjamin Jan 26, 2018

Choose a reason for hiding this comment

supriti commented Jan 29, 2018 • edited

supriti commented Jan 31, 2018

mattbenjamin left a comment

Choose a reason for hiding this comment

supriti commented Feb 5, 2018

mattbenjamin commented Feb 5, 2018

mattbenjamin commented Feb 20, 2018

supriti commented Jan 29, 2018 •

edited